Overview
Brought to you by YData
Dataset statistics
| Number of variables | 153 |
|---|---|
| Number of observations | 338094 |
| Missing cells | 26940749 |
| Missing cells (%) | 52.1% |
| Total size in memory | 394.7 MiB |
| Average record size in memory | 1.2 KiB |
Variable types
| Text | 153 |
|---|
Dataset
| Description | NMNH Material Samples (USNM) 0049394-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.ycwxgd |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "http://grbio.org/cool/142r-0w94" | Constant |
datasetName has constant value "NMNH Material Samples (USNM)" | Constant |
basisOfRecord has constant value "MATERIAL_SAMPLE" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
organismName has constant value "EML" | Constant |
organismScope has constant value "2024-12-01T12:07:33.811Z" | Constant |
associatedOrganisms has constant value "2024-12-01T11:07:21.711Z" | Constant |
previousIdentifications has constant value "true" | Constant |
materialEntityRemarks has constant value "false" | Constant |
parentEventID has constant value "Panama" | Constant |
eventType has constant value "PAN.5_1" | Constant |
eventTime has constant value "Pinogana" | Constant |
identifiedByID has constant value "ACCEPTED" | Constant |
identificationVerificationStatus has constant value "26098c25-8f7f-4c71-97ac-1d3db181c65e" | Constant |
identificationRemarks has constant value "US" | Constant |
acceptedNameUsage has constant value "false" | Constant |
subtribe has constant value "EML" | Constant |
subgenus has constant value "true" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2024-12-01T11:07:21.711Z" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
catalogNumber has 70677 (20.9%) missing values | Missing |
recordNumber has 181582 (53.7%) missing values | Missing |
recordedBy has 70120 (20.7%) missing values | Missing |
individualCount has 39347 (11.6%) missing values | Missing |
sex has 265741 (78.6%) missing values | Missing |
lifeStage has 209004 (61.8%) missing values | Missing |
preparations has 251111 (74.3%) missing values | Missing |
associatedSequences has 305424 (90.3%) missing values | Missing |
occurrenceRemarks has 193547 (57.2%) missing values | Missing |
organismName has 338093 (> 99.9%) missing values | Missing |
organismScope has 338093 (> 99.9%) missing values | Missing |
associatedOrganisms has 338093 (> 99.9%) missing values | Missing |
previousIdentifications has 338093 (> 99.9%) missing values | Missing |
materialEntityRemarks has 338093 (> 99.9%) missing values | Missing |
verbatimLabel has 338089 (> 99.9%) missing values | Missing |
materialSampleID has 84986 (25.1%) missing values | Missing |
eventID has 338092 (> 99.9%) missing values | Missing |
parentEventID has 338093 (> 99.9%) missing values | Missing |
eventType has 338093 (> 99.9%) missing values | Missing |
fieldNumber has 267153 (79.0%) missing values | Missing |
eventDate has 16903 (5.0%) missing values | Missing |
eventTime has 338093 (> 99.9%) missing values | Missing |
startDayOfYear has 19910 (5.9%) missing values | Missing |
endDayOfYear has 19910 (5.9%) missing values | Missing |
year has 17140 (5.1%) missing values | Missing |
month has 22792 (6.7%) missing values | Missing |
day has 52010 (15.4%) missing values | Missing |
verbatimEventDate has 235843 (69.8%) missing values | Missing |
habitat has 302025 (89.3%) missing values | Missing |
locationID has 284620 (84.2%) missing values | Missing |
higherGeography has 4531 (1.3%) missing values | Missing |
continent has 57738 (17.1%) missing values | Missing |
waterBody has 231346 (68.4%) missing values | Missing |
islandGroup has 315374 (93.3%) missing values | Missing |
island has 279260 (82.6%) missing values | Missing |
countryCode has 11127 (3.3%) missing values | Missing |
stateProvince has 66137 (19.6%) missing values | Missing |
county has 140475 (41.5%) missing values | Missing |
locality has 34045 (10.1%) missing values | Missing |
verbatimElevation has 322170 (95.3%) missing values | Missing |
verbatimDepth has 336615 (99.6%) missing values | Missing |
minimumDistanceAboveSurfaceInMeters has 338092 (> 99.9%) missing values | Missing |
decimalLatitude has 73462 (21.7%) missing values | Missing |
decimalLongitude has 73462 (21.7%) missing values | Missing |
coordinateUncertaintyInMeters has 327083 (96.7%) missing values | Missing |
pointRadiusSpatialFit has 338090 (> 99.9%) missing values | Missing |
verbatimCoordinateSystem has 329029 (97.3%) missing values | Missing |
georeferencedBy has 338090 (> 99.9%) missing values | Missing |
georeferenceProtocol has 255273 (75.5%) missing values | Missing |
georeferenceRemarks has 328595 (97.2%) missing values | Missing |
latestEonOrHighestEonothem has 338090 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 338090 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 338090 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 338091 (> 99.9%) missing values | Missing |
latestPeriodOrHighestSystem has 338091 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 338090 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 338090 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 338090 (> 99.9%) missing values | Missing |
member has 338091 (> 99.9%) missing values | Missing |
verbatimIdentification has 338090 (> 99.9%) missing values | Missing |
identificationQualifier has 333028 (98.5%) missing values | Missing |
typeStatus has 331537 (98.1%) missing values | Missing |
identifiedBy has 226045 (66.9%) missing values | Missing |
identifiedByID has 338090 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 338090 (> 99.9%) missing values | Missing |
identificationRemarks has 338090 (> 99.9%) missing values | Missing |
taxonID has 338090 (> 99.9%) missing values | Missing |
scientificNameID has 338092 (> 99.9%) missing values | Missing |
acceptedNameUsageID has 6111 (1.8%) missing values | Missing |
namePublishedInID has 338090 (> 99.9%) missing values | Missing |
acceptedNameUsage has 338090 (> 99.9%) missing values | Missing |
parentNameUsage has 338090 (> 99.9%) missing values | Missing |
originalNameUsage has 338090 (> 99.9%) missing values | Missing |
nameAccordingTo has 338090 (> 99.9%) missing values | Missing |
namePublishedIn has 338090 (> 99.9%) missing values | Missing |
namePublishedInYear has 338091 (> 99.9%) missing values | Missing |
higherClassification has 5891 (1.7%) missing values | Missing |
phylum has 6808 (2.0%) missing values | Missing |
class has 52277 (15.5%) missing values | Missing |
order has 30344 (9.0%) missing values | Missing |
superfamily has 338091 (> 99.9%) missing values | Missing |
family has 19906 (5.9%) missing values | Missing |
subfamily has 338090 (> 99.9%) missing values | Missing |
subtribe has 338090 (> 99.9%) missing values | Missing |
genus has 34392 (10.2%) missing values | Missing |
genericName has 34393 (10.2%) missing values | Missing |
subgenus has 338090 (> 99.9%) missing values | Missing |
specificEpithet has 89523 (26.5%) missing values | Missing |
infraspecificEpithet has 328999 (97.3%) missing values | Missing |
cultivarEpithet has 338090 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 338090 (> 99.9%) missing values | Missing |
vernacularName has 338090 (> 99.9%) missing values | Missing |
nomenclaturalCode has 338090 (> 99.9%) missing values | Missing |
taxonomicStatus has 6108 (1.8%) missing values | Missing |
nomenclaturalStatus has 338091 (> 99.9%) missing values | Missing |
taxonRemarks has 338091 (> 99.9%) missing values | Missing |
elevation has 248950 (73.6%) missing values | Missing |
elevationAccuracy has 284393 (84.1%) missing values | Missing |
depth has 262666 (77.7%) missing values | Missing |
depthAccuracy has 272182 (80.5%) missing values | Missing |
distanceFromCentroidInMeters has 335404 (99.2%) missing values | Missing |
issue has 45626 (13.5%) missing values | Missing |
mediaType has 324090 (95.9%) missing values | Missing |
acceptedTaxonKey has 6112 (1.8%) missing values | Missing |
phylumKey has 6812 (2.0%) missing values | Missing |
classKey has 52277 (15.5%) missing values | Missing |
orderKey has 30347 (9.0%) missing values | Missing |
familyKey has 19910 (5.9%) missing values | Missing |
genusKey has 34396 (10.2%) missing values | Missing |
speciesKey has 89520 (26.5%) missing values | Missing |
species has 89520 (26.5%) missing values | Missing |
acceptedScientificName has 6112 (1.8%) missing values | Missing |
verbatimScientificName has 24039 (7.1%) missing values | Missing |
typifiedName has 338060 (> 99.9%) missing values | Missing |
repatriated has 10837 (3.2%) missing values | Missing |
gbifRegion has 12220 (3.6%) missing values | Missing |
level0Gid has 157741 (46.7%) missing values | Missing |
level0Name has 157741 (46.7%) missing values | Missing |
level1Gid has 158980 (47.0%) missing values | Missing |
level1Name has 158980 (47.0%) missing values | Missing |
level2Gid has 167237 (49.5%) missing values | Missing |
level2Name has 167249 (49.5%) missing values | Missing |
level3Gid has 300258 (88.8%) missing values | Missing |
level3Name has 300562 (88.9%) missing values | Missing |
iucnRedListCategory has 63456 (18.8%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:42:19.861325 |
|---|---|
| Analysis finished | 2025-01-08 22:42:37.312514 |
| Duration | 17.45 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 338094 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 338094 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 4501677301 |
|---|---|
| 2nd row | 3027962301 |
| 3rd row | 3028050301 |
| 4th row | 3027962302 |
| 5th row | 3028050302 |
| Value | Count | Frequency (%) |
| 4501677301 | 1 | < 0.1% |
| 4909491303 | 1 | < 0.1% |
| 3357130301 | 1 | < 0.1% |
| 3027962303 | 1 | < 0.1% |
| 3758404301 | 1 | < 0.1% |
| 3027962304 | 1 | < 0.1% |
| 3336913301 | 1 | < 0.1% |
| 3028050303 | 1 | < 0.1% |
| 4909491307 | 1 | < 0.1% |
| 3028050304 | 1 | < 0.1% |
| Other values (338084) | 338084 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 540984 | |
| 3 | 466213 | |
| 9 | 357116 | |
| 2 | 356645 | |
| 8 | 331017 | |
| 4 | 317355 | |
| 1 | 298199 | |
| 5 | 263435 | |
| 7 | 254191 | |
| 6 | 195785 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3380940 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 540984 | |
| 3 | 466213 | |
| 9 | 357116 | |
| 2 | 356645 | |
| 8 | 331017 | |
| 4 | 317355 | |
| 1 | 298199 | |
| 5 | 263435 | |
| 7 | 254191 | |
| 6 | 195785 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3380940 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 540984 | |
| 3 | 466213 | |
| 9 | 357116 | |
| 2 | 356645 | |
| 8 | 331017 | |
| 4 | 317355 | |
| 1 | 298199 | |
| 5 | 263435 | |
| 7 | 254191 | |
| 6 | 195785 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3380940 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 540984 | |
| 3 | 466213 | |
| 9 | 357116 | |
| 2 | 356645 | |
| 8 | 331017 | |
| 4 | 317355 | |
| 1 | 298199 | |
| 5 | 263435 | |
| 7 | 254191 | |
| 6 | 195785 | 5.8% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 338094 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 676188 | |
| 0 | 676188 | |
| _ | 676188 | |
| 1 | 338094 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1014282 | |
| Uppercase Letter | 676188 | |
| Connector Punctuation | 676188 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 676188 | |
| 1 | 338094 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 676188 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 676188 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1690470 | |
| Latin | 676188 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 676188 | |
| _ | 676188 | |
| 1 | 338094 |
Latin
| Value | Count | Frequency (%) |
| C | 676188 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2366658 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 676188 | |
| 0 | 676188 | |
| _ | 676188 | |
| 1 | 338094 |
modified
Text
| Distinct | 10795 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 2110 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 2024-06-26T12:37:00Z |
|---|---|
| 2nd row | 2021-10-14T09:12:00Z |
| 3rd row | 2022-07-20T16:25:00Z |
| 4th row | 2021-10-13T15:49:00Z |
| 5th row | 2019-06-25T16:21:00Z |
| Value | Count | Frequency (%) |
| 2023-06-13t09:52:00z | 2840 | 0.8% |
| 2024-10-17t11:06:00z | 2662 | 0.8% |
| 2021-10-13t15:50:00z | 2652 | 0.8% |
| 2021-10-13t15:49:00z | 2517 | 0.7% |
| 2022-10-17t16:14:00z | 2414 | 0.7% |
| 2022-10-17t16:13:00z | 2368 | 0.7% |
| 2021-10-14t09:09:00z | 2340 | 0.7% |
| 2021-10-14t09:10:00z | 2235 | 0.7% |
| 2021-10-14t09:08:00z | 2151 | 0.6% |
| 2021-10-13t15:48:00z | 2042 | 0.6% |
| Other values (10785) | 313873 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1616584 | |
| 2 | 1048026 | |
| 1 | 802624 | |
| - | 676188 | |
| : | 676188 | |
| T | 338094 | 5.0% |
| Z | 338094 | 5.0% |
| 3 | 230104 | 3.4% |
| 4 | 213568 | 3.2% |
| 5 | 202272 | 3.0% |
| Other values (4) | 620138 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4733316 | |
| Dash Punctuation | 676188 | 10.0% |
| Other Punctuation | 676188 | 10.0% |
| Uppercase Letter | 676188 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1616584 | |
| 2 | 1048026 | |
| 1 | 802624 | |
| 3 | 230104 | 4.9% |
| 4 | 213568 | 4.5% |
| 5 | 202272 | 4.3% |
| 7 | 196522 | 4.2% |
| 6 | 170692 | 3.6% |
| 9 | 151865 | 3.2% |
| 8 | 101059 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 338094 | |
| Z | 338094 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 676188 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676188 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6085692 | |
| Latin | 676188 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1616584 | |
| 2 | 1048026 | |
| 1 | 802624 | |
| - | 676188 | |
| : | 676188 | |
| 3 | 230104 | 3.8% |
| 4 | 213568 | 3.5% |
| 5 | 202272 | 3.3% |
| 7 | 196522 | 3.2% |
| 6 | 170692 | 2.8% |
| Other values (2) | 252924 | 4.2% |
Latin
| Value | Count | Frequency (%) |
| T | 338094 | |
| Z | 338094 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6761880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1616584 | |
| 2 | 1048026 | |
| 1 | 802624 | |
| - | 676188 | |
| : | 676188 | |
| T | 338094 | 5.0% |
| Z | 338094 | 5.0% |
| 3 | 230104 | 3.4% |
| 4 | 213568 | 3.2% |
| 5 | 202272 | 3.0% |
| Other values (4) | 620138 | 9.2% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 338094 | |
| museum | 338094 | |
| of | 338094 | |
| natural | 338094 | |
| history | 338094 | |
| smithsonian | 338094 | |
| institution | 338094 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2366658 | |
| i | 2028564 | |
| 2028564 | ||
| a | 1690470 | 8.5% |
| o | 1690470 | 8.5% |
| n | 1690470 | 8.5% |
| s | 1352376 | 6.8% |
| u | 1352376 | 6.8% |
| r | 676188 | 3.4% |
| m | 676188 | 3.4% |
| Other values (11) | 4395222 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15552324 | |
| Space Separator | 2028564 | 10.2% |
| Uppercase Letter | 2028564 | 10.2% |
| Other Punctuation | 338094 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2366658 | |
| i | 2028564 | |
| a | 1690470 | |
| o | 1690470 | |
| n | 1690470 | |
| s | 1352376 | |
| u | 1352376 | |
| r | 676188 | 4.3% |
| m | 676188 | 4.3% |
| l | 676188 | 4.3% |
| Other values (4) | 1352376 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 676188 | |
| M | 338094 | |
| H | 338094 | |
| S | 338094 | |
| I | 338094 |
Space Separator
| Value | Count | Frequency (%) |
| 2028564 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 338094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17580888 | |
| Common | 2366658 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2366658 | |
| i | 2028564 | |
| a | 1690470 | |
| o | 1690470 | |
| n | 1690470 | |
| s | 1352376 | 7.7% |
| u | 1352376 | 7.7% |
| r | 676188 | 3.8% |
| m | 676188 | 3.8% |
| N | 676188 | 3.8% |
| Other values (9) | 3380940 |
Common
| Value | Count | Frequency (%) |
| 2028564 | ||
| , | 338094 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19947546 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 2366658 | |
| i | 2028564 | |
| 2028564 | ||
| a | 1690470 | 8.5% |
| o | 1690470 | 8.5% |
| n | 1690470 | 8.5% |
| s | 1352376 | 6.8% |
| u | 1352376 | 6.8% |
| r | 676188 | 3.4% |
| m | 676188 | 3.4% |
| Other values (11) | 4395222 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 31 |
| Mean length | 31 |
| Min length | 31 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | http://grbio.org/cool/142r-0w94 |
|---|---|
| 2nd row | http://grbio.org/cool/142r-0w94 |
| 3rd row | http://grbio.org/cool/142r-0w94 |
| 4th row | http://grbio.org/cool/142r-0w94 |
| 5th row | http://grbio.org/cool/142r-0w94 |
| Value | Count | Frequency (%) |
| http://grbio.org/cool/142r-0w94 | 338094 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 1352376 | 12.9% |
| o | 1352376 | 12.9% |
| r | 1014282 | 9.7% |
| g | 676188 | 6.5% |
| t | 676188 | 6.5% |
| 4 | 676188 | 6.5% |
| h | 338094 | 3.2% |
| 1 | 338094 | 3.2% |
| w | 338094 | 3.2% |
| 0 | 338094 | 3.2% |
| Other values (10) | 3380940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6085692 | |
| Other Punctuation | 2028564 | 19.4% |
| Decimal Number | 2028564 | 19.4% |
| Dash Punctuation | 338094 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1352376 | |
| r | 1014282 | |
| g | 676188 | |
| t | 676188 | |
| h | 338094 | 5.6% |
| w | 338094 | 5.6% |
| l | 338094 | 5.6% |
| c | 338094 | 5.6% |
| i | 338094 | 5.6% |
| b | 338094 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 676188 | |
| 1 | 338094 | |
| 0 | 338094 | |
| 2 | 338094 | |
| 9 | 338094 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1352376 | |
| . | 338094 | 16.7% |
| : | 338094 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 338094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6085692 | |
| Common | 4395222 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1352376 | |
| r | 1014282 | |
| g | 676188 | |
| t | 676188 | |
| h | 338094 | 5.6% |
| w | 338094 | 5.6% |
| l | 338094 | 5.6% |
| c | 338094 | 5.6% |
| i | 338094 | 5.6% |
| b | 338094 | 5.6% |
Common
| Value | Count | Frequency (%) |
| / | 1352376 | |
| 4 | 676188 | |
| 1 | 338094 | 7.7% |
| 0 | 338094 | 7.7% |
| - | 338094 | 7.7% |
| 2 | 338094 | 7.7% |
| . | 338094 | 7.7% |
| : | 338094 | 7.7% |
| 9 | 338094 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10480914 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 1352376 | 12.9% |
| o | 1352376 | 12.9% |
| r | 1014282 | 9.7% |
| g | 676188 | 6.5% |
| t | 676188 | 6.5% |
| 4 | 676188 | 6.5% |
| h | 338094 | 3.2% |
| 1 | 338094 | 3.2% |
| w | 338094 | 3.2% |
| 0 | 338094 | 3.2% |
| Other values (10) | 3380940 |
collectionID
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
|---|---|
| 2nd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 3rd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 4th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 5th row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| Value | Count | Frequency (%) |
| urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad | 119032 | |
| urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 | 74362 | |
| urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 | 42251 | 12.5% |
| urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f | 41564 | 12.3% |
| urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 | 28248 | 8.4% |
| urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 | 24486 | 7.2% |
| urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 | 8151 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1352376 | 8.9% |
| d | 1154135 | 7.6% |
| c | 1039600 | 6.8% |
| u | 1014282 | 6.7% |
| 8 | 916661 | 6.0% |
| 0 | 796356 | 5.2% |
| a | 774527 | 5.1% |
| 1 | 740909 | 4.9% |
| 9 | 705119 | 4.6% |
| : | 676188 | 4.4% |
| Other values (12) | 6044077 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6795599 | |
| Decimal Number | 6390067 | |
| Dash Punctuation | 1352376 | 8.9% |
| Other Punctuation | 676188 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1154135 | |
| c | 1039600 | |
| u | 1014282 | |
| a | 774527 | |
| f | 673171 | |
| b | 653347 | |
| e | 472255 | |
| i | 338094 | 5.0% |
| r | 338094 | 5.0% |
| n | 338094 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 916661 | |
| 0 | 796356 | |
| 1 | 740909 | |
| 9 | 705119 | |
| 3 | 620084 | |
| 2 | 619077 | |
| 6 | 590600 | |
| 4 | 581803 | |
| 7 | 477790 | |
| 5 | 341668 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1352376 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676188 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8418631 | |
| Latin | 6795599 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1352376 | |
| 8 | 916661 | |
| 0 | 796356 | |
| 1 | 740909 | |
| 9 | 705119 | |
| : | 676188 | |
| 3 | 620084 | |
| 2 | 619077 | |
| 6 | 590600 | |
| 4 | 581803 | |
| Other values (2) | 819458 |
Latin
| Value | Count | Frequency (%) |
| d | 1154135 | |
| c | 1039600 | |
| u | 1014282 | |
| a | 774527 | |
| f | 673171 | |
| b | 653347 | |
| e | 472255 | |
| i | 338094 | 5.0% |
| r | 338094 | 5.0% |
| n | 338094 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15214230 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1352376 | 8.9% |
| d | 1154135 | 7.6% |
| c | 1039600 | 6.8% |
| u | 1014282 | 6.7% |
| 8 | 916661 | 6.0% |
| 0 | 796356 | 5.2% |
| a | 774527 | 5.1% |
| 1 | 740909 | 4.9% |
| 9 | 705119 | 4.6% |
| : | 676188 | 4.4% |
| Other values (12) | 6044077 |
institutionCode
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.750063592 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | US |
| Value | Count | Frequency (%) |
| usnm | 295843 | |
| us | 42251 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 338094 | |
| S | 338094 | |
| N | 295843 | |
| M | 295843 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1267874 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 338094 | |
| S | 338094 | |
| N | 295843 | |
| M | 295843 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1267874 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 338094 | |
| S | 338094 | |
| N | 295843 | |
| M | 295843 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1267874 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 338094 | |
| S | 338094 | |
| N | 295843 | |
| M | 295843 |
collectionCode
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 2.982215005 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENT |
|---|---|
| 2nd row | IZ |
| 3rd row | IZ |
| 4th row | IZ |
| 5th row | US |
| Value | Count | Frequency (%) |
| ent | 119032 | |
| iz | 74362 | |
| us | 42251 | 12.5% |
| fish | 41564 | 12.3% |
| herp | 28248 | 8.4% |
| mamm | 24486 | 7.2% |
| birds | 8151 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 147280 | |
| I | 124077 | |
| N | 119032 | |
| T | 119032 | |
| S | 91966 | |
| Z | 74362 | |
| M | 73458 | |
| H | 69812 | |
| U | 42251 | 4.2% |
| F | 41564 | 4.1% |
| Other values (5) | 105435 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1008269 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 147280 | |
| I | 124077 | |
| N | 119032 | |
| T | 119032 | |
| S | 91966 | |
| Z | 74362 | |
| M | 73458 | |
| H | 69812 | |
| U | 42251 | 4.2% |
| F | 41564 | 4.1% |
| Other values (5) | 105435 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1008269 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 147280 | |
| I | 124077 | |
| N | 119032 | |
| T | 119032 | |
| S | 91966 | |
| Z | 74362 | |
| M | 73458 | |
| H | 69812 | |
| U | 42251 | 4.2% |
| F | 41564 | 4.1% |
| Other values (5) | 105435 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1008269 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 147280 | |
| I | 124077 | |
| N | 119032 | |
| T | 119032 | |
| S | 91966 | |
| Z | 74362 | |
| M | 73458 | |
| H | 69812 | |
| U | 42251 | 4.2% |
| F | 41564 | 4.1% |
| Other values (5) | 105435 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 28 |
| Mean length | 28 |
| Min length | 28 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Material Samples (USNM) |
|---|---|
| 2nd row | NMNH Material Samples (USNM) |
| 3rd row | NMNH Material Samples (USNM) |
| 4th row | NMNH Material Samples (USNM) |
| 5th row | NMNH Material Samples (USNM) |
| Value | Count | Frequency (%) |
| nmnh | 338094 | |
| material | 338094 | |
| samples | 338094 | |
| usnm | 338094 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1014282 | |
| 1014282 | ||
| a | 1014282 | |
| M | 1014282 | |
| e | 676188 | 7.1% |
| l | 676188 | 7.1% |
| S | 676188 | 7.1% |
| p | 338094 | 3.6% |
| U | 338094 | 3.6% |
| ( | 338094 | 3.6% |
| Other values (7) | 2366658 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4395222 | |
| Uppercase Letter | 3380940 | |
| Space Separator | 1014282 | 10.7% |
| Open Punctuation | 338094 | 3.6% |
| Close Punctuation | 338094 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1014282 | |
| e | 676188 | |
| l | 676188 | |
| p | 338094 | 7.7% |
| s | 338094 | 7.7% |
| i | 338094 | 7.7% |
| m | 338094 | 7.7% |
| r | 338094 | 7.7% |
| t | 338094 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1014282 | |
| M | 1014282 | |
| S | 676188 | |
| U | 338094 | 10.0% |
| H | 338094 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1014282 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 338094 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 338094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7776162 | |
| Common | 1690470 | 17.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1014282 | |
| a | 1014282 | |
| M | 1014282 | |
| e | 676188 | |
| l | 676188 | |
| S | 676188 | |
| p | 338094 | 4.3% |
| U | 338094 | 4.3% |
| s | 338094 | 4.3% |
| i | 338094 | 4.3% |
| Other values (4) | 1352376 |
Common
| Value | Count | Frequency (%) |
| 1014282 | ||
| ( | 338094 | 20.0% |
| ) | 338094 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9466632 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1014282 | |
| 1014282 | ||
| a | 1014282 | |
| M | 1014282 | |
| e | 676188 | 7.1% |
| l | 676188 | 7.1% |
| S | 676188 | 7.1% |
| p | 338094 | 3.6% |
| U | 338094 | 3.6% |
| ( | 338094 | 3.6% |
| Other values (7) | 2366658 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MATERIAL_SAMPLE |
|---|---|
| 2nd row | MATERIAL_SAMPLE |
| 3rd row | MATERIAL_SAMPLE |
| 4th row | MATERIAL_SAMPLE |
| 5th row | MATERIAL_SAMPLE |
| Value | Count | Frequency (%) |
| material_sample | 338094 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1014282 | |
| M | 676188 | |
| E | 676188 | |
| L | 676188 | |
| T | 338094 | 6.7% |
| R | 338094 | 6.7% |
| I | 338094 | 6.7% |
| _ | 338094 | 6.7% |
| S | 338094 | 6.7% |
| P | 338094 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4733316 | |
| Connector Punctuation | 338094 | 6.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1014282 | |
| M | 676188 | |
| E | 676188 | |
| L | 676188 | |
| T | 338094 | 7.1% |
| R | 338094 | 7.1% |
| I | 338094 | 7.1% |
| S | 338094 | 7.1% |
| P | 338094 | 7.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 338094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4733316 | |
| Common | 338094 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1014282 | |
| M | 676188 | |
| E | 676188 | |
| L | 676188 | |
| T | 338094 | 7.1% |
| R | 338094 | 7.1% |
| I | 338094 | 7.1% |
| S | 338094 | 7.1% |
| P | 338094 | 7.1% |
Common
| Value | Count | Frequency (%) |
| _ | 338094 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5071410 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1014282 | |
| M | 676188 | |
| E | 676188 | |
| L | 676188 | |
| T | 338094 | 6.7% |
| R | 338094 | 6.7% |
| I | 338094 | 6.7% |
| _ | 338094 | 6.7% |
| S | 338094 | 6.7% |
| P | 338094 | 6.7% |
occurrenceID
Text
Unique 
| Distinct | 338094 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 338094 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/300028c5f-ea1d-4c01-9253-09524fc57db6 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/30006cd83-36b3-4629-86db-f5a28307189f |
| 3rd row | http://n2t.net/ark:/65665/30007a443-7a0a-49a9-9c54-cae1342160a6 |
| 4th row | http://n2t.net/ark:/65665/300098b69-426b-451c-a675-27a1b7bb5b60 |
| 5th row | http://n2t.net/ark:/65665/3000a9424-501b-43e7-a337-ee632a8fa9d0 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/300028c5f-ea1d-4c01-9253-09524fc57db6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000ef5c5-8164-4ad4-b093-79821f58ace8 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300114e18-4d31-4558-acc1-47ce8dd8940c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300119514-9afd-4342-83ae-3526ac40f20f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300154f73-1f7a-4d73-8c43-7c6d66c03b0f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30015c5b5-263e-4d28-916f-89728207dfda | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3001878d3-3d26-4b66-9ad5-77d6938de137 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300187c30-1f5e-4401-a208-4e42206dc341 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300193d42-6a2a-41b9-b203-29e571953cd6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3001b5554-545c-479e-a09c-f732f7e77413 | 1 | < 0.1% |
| Other values (338084) | 338084 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 1690470 | 7.9% |
| 6 | 1647936 | 7.7% |
| - | 1352376 | 6.3% |
| t | 1352376 | 6.3% |
| 5 | 1309841 | 6.1% |
| a | 1056185 | 5.0% |
| 4 | 972781 | 4.6% |
| 3 | 972418 | 4.6% |
| 2 | 971788 | 4.6% |
| e | 970508 | 4.6% |
| Other values (16) | 9003243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9213538 | |
| Lowercase Letter | 8029256 | |
| Other Punctuation | 2704752 | 12.7% |
| Dash Punctuation | 1352376 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1352376 | |
| a | 1056185 | |
| e | 970508 | |
| b | 718089 | |
| n | 676188 | |
| f | 634963 | |
| c | 634809 | |
| d | 633762 | |
| k | 338094 | 4.2% |
| r | 338094 | 4.2% |
| Other values (2) | 676188 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1647936 | |
| 5 | 1309841 | |
| 4 | 972781 | |
| 3 | 972418 | |
| 2 | 971788 | |
| 9 | 718905 | |
| 8 | 717477 | |
| 0 | 634479 | 6.9% |
| 1 | 633985 | 6.9% |
| 7 | 633928 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1690470 | |
| : | 676188 | 25.0% |
| . | 338094 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1352376 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13270666 | |
| Latin | 8029256 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 1690470 | |
| 6 | 1647936 | |
| - | 1352376 | |
| 5 | 1309841 | |
| 4 | 972781 | |
| 3 | 972418 | |
| 2 | 971788 | |
| 9 | 718905 | 5.4% |
| 8 | 717477 | 5.4% |
| : | 676188 | 5.1% |
| Other values (4) | 2240486 |
Latin
| Value | Count | Frequency (%) |
| t | 1352376 | |
| a | 1056185 | |
| e | 970508 | |
| b | 718089 | |
| n | 676188 | |
| f | 634963 | |
| c | 634809 | |
| d | 633762 | |
| k | 338094 | 4.2% |
| r | 338094 | 4.2% |
| Other values (2) | 676188 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21299922 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 1690470 | 7.9% |
| 6 | 1647936 | 7.7% |
| - | 1352376 | 6.3% |
| t | 1352376 | 6.3% |
| 5 | 1309841 | 6.1% |
| a | 1056185 | 5.0% |
| 4 | 972781 | 4.6% |
| 3 | 972418 | 4.6% |
| 2 | 971788 | 4.6% |
| e | 970508 | 4.6% |
| Other values (16) | 9003243 |
catalogNumber
Text
Missing 
| Distinct | 225831 |
|---|---|
| Distinct (%) | 84.4% |
| Missing | 70677 |
| Missing (%) | 20.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 14.08573127 |
| Min length | 9 |
Unique
| Unique | 192785 ? |
|---|---|
| Unique (%) | 72.1% |
Sample
| 1st row | USNMENT00976719.2 |
|---|---|
| 2nd row | USNM 1566725 |
| 3rd row | USNM 1430312 |
| 4th row | USNM 1477111 |
| 5th row | USNMENT01646520 |
| Value | Count | Frequency (%) |
| usnm | 146196 | |
| herp | 7474 | 1.7% |
| tissue | 7183 | 1.6% |
| us | 2191 | 0.5% |
| 2187 | 0.5% | |
| lot | 2187 | 0.5% |
| wet | 2187 | 0.5% |
| image | 291 | 0.1% |
| 594492 | 64 | < 0.1% |
| 1487948 | 58 | < 0.1% |
| Other values (223433) | 267295 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 384256 | 10.2% |
| 1 | 338698 | 9.0% |
| 0 | 282088 | 7.5% |
| S | 267418 | 7.1% |
| U | 267417 | 7.1% |
| M | 265226 | 7.0% |
| 4 | 250702 | 6.7% |
| 6 | 201321 | 5.3% |
| 3 | 187391 | 5.0% |
| 2 | 174893 | 4.6% |
| Other values (26) | 1147354 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1989649 | |
| Uppercase Letter | 1437341 | |
| Space Separator | 169896 | 4.5% |
| Other Punctuation | 95068 | 2.5% |
| Lowercase Letter | 72623 | 1.9% |
| Dash Punctuation | 2187 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17135 | |
| s | 14366 | |
| p | 7474 | |
| r | 7474 | |
| i | 7183 | |
| u | 7183 | |
| t | 4374 | 6.0% |
| w | 2187 | 3.0% |
| l | 2187 | 3.0% |
| o | 2187 | 3.0% |
| Other values (3) | 873 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 384256 | |
| S | 267418 | |
| U | 267417 | |
| M | 265226 | |
| T | 126214 | 8.8% |
| E | 119030 | 8.3% |
| H | 7474 | 0.5% |
| I | 291 | < 0.1% |
| A | 14 | < 0.1% |
| R | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 338698 | |
| 0 | 282088 | |
| 4 | 250702 | |
| 6 | 201321 | |
| 3 | 187391 | |
| 2 | 174893 | |
| 5 | 167336 | |
| 9 | 130800 | 6.6% |
| 7 | 128397 | 6.5% |
| 8 | 128023 | 6.4% |
Space Separator
| Value | Count | Frequency (%) |
| 169896 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 95068 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2187 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2256800 | |
| Latin | 1509964 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 384256 | |
| S | 267418 | |
| U | 267417 | |
| M | 265226 | |
| T | 126214 | 8.4% |
| E | 119030 | 7.9% |
| e | 17135 | 1.1% |
| s | 14366 | 1.0% |
| p | 7474 | 0.5% |
| r | 7474 | 0.5% |
| Other values (13) | 33954 | 2.2% |
Common
| Value | Count | Frequency (%) |
| 1 | 338698 | |
| 0 | 282088 | |
| 4 | 250702 | |
| 6 | 201321 | |
| 3 | 187391 | |
| 2 | 174893 | |
| 169896 | ||
| 5 | 167336 | |
| 9 | 130800 | 5.8% |
| 7 | 128397 | 5.7% |
| Other values (3) | 225278 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3766764 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 384256 | 10.2% |
| 1 | 338698 | 9.0% |
| 0 | 282088 | 7.5% |
| S | 267418 | 7.1% |
| U | 267417 | 7.1% |
| M | 265226 | 7.0% |
| 4 | 250702 | 6.7% |
| 6 | 201321 | 5.3% |
| 3 | 187391 | 5.0% |
| 2 | 174893 | 4.6% |
| Other values (26) | 1147354 |
recordNumber
Text
Missing 
| Distinct | 102948 |
|---|---|
| Distinct (%) | 65.8% |
| Missing | 181582 |
| Missing (%) | 53.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 53 |
| Mean length | 8.259181405 |
| Min length | 1 |
Unique
| Unique | 67344 ? |
|---|---|
| Unique (%) | 43.0% |
Sample
| 1st row | T548-A9-TW19 |
|---|---|
| 2nd row | BMOO-09792 |
| 3rd row | JC3629 |
| 4th row | 707 |
| 5th row | mbio988 |
| Value | Count | Frequency (%) |
| blz | 5367 | 2.9% |
| d&ml | 4441 | 2.4% |
| 1570 | 0.8% | |
| tag | 1340 | 0.7% |
| tree | 1340 | 0.7% |
| flmoo | 1323 | 0.7% |
| blb | 1217 | 0.6% |
| sms | 1215 | 0.6% |
| bah | 989 | 0.5% |
| tob | 834 | 0.4% |
| Other values (93496) | 168604 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 122723 | 9.5% |
| 2 | 92650 | 7.2% |
| 0 | 89200 | 6.9% |
| 3 | 72330 | 5.6% |
| - | 60851 | 4.7% |
| 5 | 57939 | 4.5% |
| 4 | 57585 | 4.5% |
| 6 | 53715 | 4.2% |
| 8 | 52782 | 4.1% |
| 7 | 52215 | 4.0% |
| Other values (66) | 580671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 701177 | |
| Uppercase Letter | 421903 | |
| Dash Punctuation | 60866 | 4.7% |
| Lowercase Letter | 41237 | 3.2% |
| Space Separator | 31728 | 2.5% |
| Connector Punctuation | 19931 | 1.5% |
| Other Punctuation | 11779 | 0.9% |
| Close Punctuation | 2020 | 0.2% |
| Open Punctuation | 2020 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 37547 | 8.9% |
| B | 36981 | 8.8% |
| O | 31709 | 7.5% |
| M | 31417 | 7.4% |
| S | 27817 | 6.6% |
| A | 26249 | 6.2% |
| R | 24656 | 5.8% |
| T | 22619 | 5.4% |
| L | 20601 | 4.9% |
| E | 18889 | 4.5% |
| Other values (16) | 143418 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5194 | |
| i | 4265 | |
| a | 4119 | |
| b | 4004 | |
| o | 4001 | |
| r | 3388 | |
| m | 3332 | |
| l | 2843 | |
| s | 1665 | 4.0% |
| v | 1558 | 3.8% |
| Other values (15) | 6868 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 122723 | |
| 2 | 92650 | |
| 0 | 89200 | |
| 3 | 72330 | |
| 5 | 57939 | |
| 4 | 57585 | |
| 6 | 53715 | |
| 8 | 52782 | |
| 7 | 52215 | |
| 9 | 50038 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4691 | |
| & | 4583 | |
| # | 1512 | 12.8% |
| . | 919 | 7.8% |
| / | 49 | 0.4% |
| ? | 22 | 0.2% |
| : | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 60851 | |
| – | 15 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2006 | |
| ] | 14 | 0.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2006 | |
| [ | 14 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 31728 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19931 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 829521 | |
| Latin | 463140 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 37547 | 8.1% |
| B | 36981 | 8.0% |
| O | 31709 | 6.8% |
| M | 31417 | 6.8% |
| S | 27817 | 6.0% |
| A | 26249 | 5.7% |
| R | 24656 | 5.3% |
| T | 22619 | 4.9% |
| L | 20601 | 4.4% |
| E | 18889 | 4.1% |
| Other values (41) | 184655 |
Common
| Value | Count | Frequency (%) |
| 1 | 122723 | |
| 2 | 92650 | |
| 0 | 89200 | |
| 3 | 72330 | |
| - | 60851 | |
| 5 | 57939 | |
| 4 | 57585 | |
| 6 | 53715 | |
| 8 | 52782 | |
| 7 | 52215 | |
| Other values (15) | 117531 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1292646 | |
| Punctuation | 15 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 122723 | 9.5% |
| 2 | 92650 | 7.2% |
| 0 | 89200 | 6.9% |
| 3 | 72330 | 5.6% |
| - | 60851 | 4.7% |
| 5 | 57939 | 4.5% |
| 4 | 57585 | 4.5% |
| 6 | 53715 | 4.2% |
| 8 | 52782 | 4.1% |
| 7 | 52215 | 4.0% |
| Other values (65) | 580656 |
Punctuation
| Value | Count | Frequency (%) |
| – | 15 |
recordedBy
Text
Missing 
| Distinct | 8090 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 70120 |
| Missing (%) | 20.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 161 |
|---|---|
| Median length | 107 |
| Mean length | 24.15374253 |
| Min length | 1 |
Unique
| Unique | 912 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | R. Wielgus |
|---|---|
| 2nd row | R. Vrijenhoek |
| 3rd row | S. McPherson |
| 4th row | K. Crandall, H. Robinson, J. Buhay & A. Toon |
| 5th row | Tibet-MacArthur, D. A. Bell, V. A. Funk, S. Ge, Y. Meng, Z. Nie, R. Ree, J. Wen, J. Yue & W. Zuo |
| Value | Count | Frequency (%) |
| 115458 | 8.9% | |
| m | 70969 | 5.5% |
| j | 68929 | 5.3% |
| r | 47195 | 3.6% |
| d | 44002 | 3.4% |
| c | 43587 | 3.4% |
| s | 40805 | 3.1% |
| k | 35410 | 2.7% |
| l | 29135 | 2.2% |
| a | 28392 | 2.2% |
| Other values (5513) | 775991 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1031899 | ||
| . | 564920 | 8.7% |
| e | 432034 | 6.7% |
| a | 359856 | 5.6% |
| n | 295498 | 4.6% |
| r | 285565 | 4.4% |
| i | 278605 | 4.3% |
| l | 261098 | 4.0% |
| o | 258914 | 4.0% |
| t | 195845 | 3.0% |
| Other values (73) | 2508341 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3375202 | |
| Uppercase Letter | 1202334 | 18.6% |
| Space Separator | 1031899 | 15.9% |
| Other Punctuation | 838090 | 12.9% |
| Dash Punctuation | 13922 | 0.2% |
| Decimal Number | 8798 | 0.1% |
| Close Punctuation | 1220 | < 0.1% |
| Open Punctuation | 1110 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 432034 | |
| a | 359856 | |
| n | 295498 | |
| r | 285565 | |
| i | 278605 | 8.3% |
| l | 261098 | 7.7% |
| o | 258914 | 7.7% |
| t | 195845 | 5.8% |
| s | 188962 | 5.6% |
| u | 120782 | 3.6% |
| Other values (27) | 698043 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 124557 | 10.4% |
| S | 91413 | 7.6% |
| C | 84834 | 7.1% |
| B | 82834 | 6.9% |
| R | 80131 | 6.7% |
| J | 77045 | 6.4% |
| P | 76310 | 6.3% |
| D | 68047 | 5.7% |
| L | 65318 | 5.4% |
| W | 57241 | 4.8% |
| Other values (17) | 394604 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2228 | |
| 1 | 2076 | |
| 2 | 2014 | |
| 0 | 1930 | |
| 8 | 370 | 4.2% |
| 6 | 94 | 1.1% |
| 4 | 84 | 1.0% |
| 3 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 564920 | |
| , | 154997 | 18.5% |
| & | 115454 | 13.8% |
| / | 2045 | 0.2% |
| ' | 674 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1010 | |
| ] | 210 | 17.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 900 | |
| [ | 210 | 18.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1031899 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4577536 | |
| Common | 1895039 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 432034 | 9.4% |
| a | 359856 | 7.9% |
| n | 295498 | 6.5% |
| r | 285565 | 6.2% |
| i | 278605 | 6.1% |
| l | 261098 | 5.7% |
| o | 258914 | 5.7% |
| t | 195845 | 4.3% |
| s | 188962 | 4.1% |
| M | 124557 | 2.7% |
| Other values (54) | 1896602 |
Common
| Value | Count | Frequency (%) |
| 1031899 | ||
| . | 564920 | |
| , | 154997 | 8.2% |
| & | 115454 | 6.1% |
| - | 13922 | 0.7% |
| 9 | 2228 | 0.1% |
| 1 | 2076 | 0.1% |
| / | 2045 | 0.1% |
| 2 | 2014 | 0.1% |
| 0 | 1930 | 0.1% |
| Other values (9) | 3554 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6470585 | |
| None | 1990 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1031899 | ||
| . | 564920 | 8.7% |
| e | 432034 | 6.7% |
| a | 359856 | 5.6% |
| n | 295498 | 4.6% |
| r | 285565 | 4.4% |
| i | 278605 | 4.3% |
| l | 261098 | 4.0% |
| o | 258914 | 4.0% |
| t | 195845 | 3.0% |
| Other values (61) | 2506351 |
None
| Value | Count | Frequency (%) |
| í | 1006 | |
| é | 487 | |
| ö | 156 | 7.8% |
| á | 138 | 6.9% |
| ó | 97 | 4.9% |
| Ç | 33 | 1.7% |
| ı | 33 | 1.7% |
| ñ | 21 | 1.1% |
| ú | 12 | 0.6% |
| ü | 3 | 0.2% |
| Other values (2) | 4 | 0.2% |
individualCount
Text
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 39347 |
| Missing (%) | 11.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000127198 |
| Min length | 1 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 294711 | |
| 0 | 2658 | 0.9% |
| 4 | 440 | 0.1% |
| 2 | 363 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 226 | 0.1% |
| 10 | 26 | < 0.1% |
| 6 | 20 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| Other values (9) | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 294745 | |
| 0 | 2688 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 368 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 298785 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 294745 | |
| 0 | 2688 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 368 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 298785 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 294745 | |
| 0 | 2688 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 368 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 298785 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 294745 | |
| 0 | 2688 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 368 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
sex
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 265741 |
| Missing (%) | 78.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 4.888021229 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| 4th row | FEMALE |
| 5th row | MALE |
| Value | Count | Frequency (%) |
| male | 40483 | |
| female | 31797 | |
| hermaphrodite | 73 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 104223 | |
| M | 72353 | |
| A | 72353 | |
| L | 72280 | |
| F | 31797 | 9.0% |
| H | 146 | < 0.1% |
| R | 146 | < 0.1% |
| P | 73 | < 0.1% |
| O | 73 | < 0.1% |
| D | 73 | < 0.1% |
| Other values (2) | 146 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 353663 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 104223 | |
| M | 72353 | |
| A | 72353 | |
| L | 72280 | |
| F | 31797 | 9.0% |
| H | 146 | < 0.1% |
| R | 146 | < 0.1% |
| P | 73 | < 0.1% |
| O | 73 | < 0.1% |
| D | 73 | < 0.1% |
| Other values (2) | 146 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 353663 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 104223 | |
| M | 72353 | |
| A | 72353 | |
| L | 72280 | |
| F | 31797 | 9.0% |
| H | 146 | < 0.1% |
| R | 146 | < 0.1% |
| P | 73 | < 0.1% |
| O | 73 | < 0.1% |
| D | 73 | < 0.1% |
| Other values (2) | 146 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 353663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 104223 | |
| M | 72353 | |
| A | 72353 | |
| L | 72280 | |
| F | 31797 | 9.0% |
| H | 146 | < 0.1% |
| R | 146 | < 0.1% |
| P | 73 | < 0.1% |
| O | 73 | < 0.1% |
| D | 73 | < 0.1% |
| Other values (2) | 146 | < 0.1% |
lifeStage
Text
Missing 
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 209004 |
| Missing (%) | 61.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.126787513 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Adult |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 120982 | |
| juvenile | 3007 | 2.3% |
| larva | 1688 | 1.3% |
| flowering | 959 | 0.7% |
| unknown | 541 | 0.4% |
| subadult | 513 | 0.4% |
| eft | 308 | 0.2% |
| immature | 251 | 0.2% |
| veliger | 163 | 0.1% |
| fruiting | 134 | 0.1% |
| Other values (15) | 544 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 125703 | |
| u | 125581 | |
| t | 122330 | |
| d | 121514 | |
| A | 120982 | |
| e | 7872 | 1.2% |
| n | 5826 | 0.9% |
| v | 4696 | 0.7% |
| a | 4561 | 0.7% |
| i | 4472 | 0.7% |
| Other values (28) | 18280 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 532727 | |
| Uppercase Letter | 129090 | 19.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 125703 | |
| u | 125581 | |
| t | 122330 | |
| d | 121514 | |
| e | 7872 | 1.5% |
| n | 5826 | 1.1% |
| v | 4696 | 0.9% |
| a | 4561 | 0.9% |
| i | 4472 | 0.8% |
| r | 3201 | 0.6% |
| Other values (12) | 6971 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 120982 | |
| J | 3007 | 2.3% |
| L | 1688 | 1.3% |
| F | 1130 | 0.9% |
| U | 541 | 0.4% |
| S | 513 | 0.4% |
| E | 377 | 0.3% |
| I | 251 | 0.2% |
| V | 164 | 0.1% |
| P | 136 | 0.1% |
| Other values (6) | 301 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 661817 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 125703 | |
| u | 125581 | |
| t | 122330 | |
| d | 121514 | |
| A | 120982 | |
| e | 7872 | 1.2% |
| n | 5826 | 0.9% |
| v | 4696 | 0.7% |
| a | 4561 | 0.7% |
| i | 4472 | 0.7% |
| Other values (28) | 18280 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 661817 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 125703 | |
| u | 125581 | |
| t | 122330 | |
| d | 121514 | |
| A | 120982 | |
| e | 7872 | 1.2% |
| n | 5826 | 0.9% |
| v | 4696 | 0.7% |
| a | 4561 | 0.7% |
| i | 4472 | 0.7% |
| Other values (28) | 18280 | 2.8% |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 338094 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 676188 | |
| P | 338094 | |
| R | 338094 | |
| S | 338094 | |
| N | 338094 | |
| T | 338094 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2366658 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 676188 | |
| P | 338094 | |
| R | 338094 | |
| S | 338094 | |
| N | 338094 | |
| T | 338094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2366658 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 676188 | |
| P | 338094 | |
| R | 338094 | |
| S | 338094 | |
| N | 338094 | |
| T | 338094 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2366658 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 676188 | |
| P | 338094 | |
| R | 338094 | |
| S | 338094 | |
| N | 338094 | |
| T | 338094 |
preparations
Text
Missing 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 251111 |
| Missing (%) | 74.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 142 |
|---|---|
| Median length | 6 |
| Mean length | 6.19215249 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Frozen |
|---|---|
| 2nd row | Frozen |
| 3rd row | Frozen |
| 4th row | Frozen |
| 5th row | Frozen |
| Value | Count | Frequency (%) |
| frozen | 72559 | |
| vial | 6698 | 7.3% |
| ethanol | 4918 | 5.4% |
| wet | 2268 | 2.5% |
| lot | 2268 | 2.5% |
| drained | 1063 | 1.2% |
| photograph | 626 | 0.7% |
| biorepository | 456 | 0.5% |
| alcohol | 197 | 0.2% |
| 148 | 0.2% | |
| Other values (11) | 295 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 82904 | |
| n | 78637 | |
| e | 76402 | |
| r | 75218 | |
| z | 72559 | |
| F | 72209 | |
| l | 14333 | 2.7% |
| a | 13317 | 2.5% |
| t | 10591 | 2.0% |
| i | 8773 | 1.6% |
| Other values (37) | 33669 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 447509 | |
| Uppercase Letter | 84882 | 15.8% |
| Space Separator | 4513 | 0.8% |
| Other Punctuation | 836 | 0.2% |
| Decimal Number | 296 | 0.1% |
| Open Punctuation | 197 | < 0.1% |
| Close Punctuation | 197 | < 0.1% |
| Dash Punctuation | 182 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 82904 | |
| n | 78637 | |
| e | 76402 | |
| r | 75218 | |
| z | 72559 | |
| l | 14333 | 3.2% |
| a | 13317 | 3.0% |
| t | 10591 | 2.4% |
| i | 8773 | 2.0% |
| h | 6367 | 1.4% |
| Other values (13) | 8408 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 72209 | |
| E | 4953 | 5.8% |
| V | 3863 | 4.6% |
| W | 2253 | 2.7% |
| P | 626 | 0.7% |
| B | 456 | 0.5% |
| A | 242 | 0.3% |
| D | 73 | 0.1% |
| L | 49 | 0.1% |
| S | 37 | < 0.1% |
| Other values (5) | 121 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 640 | |
| % | 148 | 17.7% |
| ' | 48 | 5.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 148 | |
| 5 | 148 |
Space Separator
| Value | Count | Frequency (%) |
| 4513 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 197 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 197 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 182 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 532391 | |
| Common | 6221 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 82904 | |
| n | 78637 | |
| e | 76402 | |
| r | 75218 | |
| z | 72559 | |
| F | 72209 | |
| l | 14333 | 2.7% |
| a | 13317 | 2.5% |
| t | 10591 | 2.0% |
| i | 8773 | 1.6% |
| Other values (28) | 27448 | 5.2% |
Common
| Value | Count | Frequency (%) |
| 4513 | ||
| ; | 640 | 10.3% |
| ( | 197 | 3.2% |
| ) | 197 | 3.2% |
| - | 182 | 2.9% |
| 9 | 148 | 2.4% |
| % | 148 | 2.4% |
| 5 | 148 | 2.4% |
| ' | 48 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 538612 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 82904 | |
| n | 78637 | |
| e | 76402 | |
| r | 75218 | |
| z | 72559 | |
| F | 72209 | |
| l | 14333 | 2.7% |
| a | 13317 | 2.5% |
| t | 10591 | 2.0% |
| i | 8773 | 1.6% |
| Other values (37) | 33669 |
disposition
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.38328985 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | in collection |
|---|---|
| 2nd row | in collection |
| 3rd row | in collection |
| 4th row | in collection |
| 5th row | in collection |
| Value | Count | Frequency (%) |
| in | 298321 | |
| collection | 298321 | |
| consumed | 38009 | 6.0% |
| yes | 943 | 0.1% |
| no | 821 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 635472 | |
| o | 635472 | |
| c | 634651 | |
| i | 596642 | |
| l | 596642 | |
| e | 337273 | |
| 298321 | ||
| t | 298321 | |
| s | 38952 | 0.9% |
| u | 38009 | 0.9% |
| Other values (3) | 76961 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3888395 | |
| Space Separator | 298321 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 635472 | |
| o | 635472 | |
| c | 634651 | |
| i | 596642 | |
| l | 596642 | |
| e | 337273 | |
| t | 298321 | |
| s | 38952 | 1.0% |
| u | 38009 | 1.0% |
| m | 38009 | 1.0% |
| Other values (2) | 38952 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 298321 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3888395 | |
| Common | 298321 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 635472 | |
| o | 635472 | |
| c | 634651 | |
| i | 596642 | |
| l | 596642 | |
| e | 337273 | |
| t | 298321 | |
| s | 38952 | 1.0% |
| u | 38009 | 1.0% |
| m | 38009 | 1.0% |
| Other values (2) | 38952 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 298321 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4186716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 635472 | |
| o | 635472 | |
| c | 634651 | |
| i | 596642 | |
| l | 596642 | |
| e | 337273 | |
| 298321 | ||
| t | 298321 | |
| s | 38952 | 0.9% |
| u | 38009 | 0.9% |
| Other values (3) | 76961 | 1.8% |
Missing 
| Distinct | 25139 |
|---|---|
| Distinct (%) | 76.9% |
| Missing | 305424 |
| Missing (%) | 90.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 206 |
|---|---|
| Median length | 8 |
| Mean length | 11.03008877 |
| Min length | 8 |
Unique
| Unique | 17608 ? |
|---|---|
| Unique (%) | 53.9% |
Sample
| 1st row | MW204230;MW124559 |
|---|---|
| 2nd row | MW982336 |
| 3rd row | MF785606;MF785913 |
| 4th row | MN344605 |
| 5th row | JQ840329 |
| Value | Count | Frequency (%) |
| mw983728 | 2 | < 0.1% |
| mg967848 | 2 | < 0.1% |
| mn344832 | 2 | < 0.1% |
| mw984278 | 2 | < 0.1% |
| mn345511 | 2 | < 0.1% |
| mw277828 | 2 | < 0.1% |
| mn344810 | 2 | < 0.1% |
| kt733332 | 2 | < 0.1% |
| mw277965 | 2 | < 0.1% |
| mg968024 | 2 | < 0.1% |
| Other values (25129) | 32650 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 34014 | 9.4% |
| 3 | 32561 | 9.0% |
| 4 | 30927 | 8.6% |
| 9 | 27656 | 7.7% |
| M | 27232 | 7.6% |
| 2 | 27065 | 7.5% |
| 7 | 23931 | 6.6% |
| 0 | 23180 | 6.4% |
| 5 | 21355 | 5.9% |
| 1 | 21111 | 5.9% |
| Other values (24) | 91321 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 261882 | |
| Uppercase Letter | 87721 | 24.3% |
| Other Punctuation | 10750 | 3.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 27232 | |
| W | 9690 | 11.0% |
| O | 9358 | 10.7% |
| Q | 7270 | 8.3% |
| N | 6092 | 6.9% |
| F | 4770 | 5.4% |
| J | 4063 | 4.6% |
| H | 3581 | 4.1% |
| K | 3129 | 3.6% |
| P | 2661 | 3.0% |
| Other values (11) | 9875 | 11.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 34014 | |
| 3 | 32561 | |
| 4 | 30927 | |
| 9 | 27656 | |
| 2 | 27065 | |
| 7 | 23931 | |
| 0 | 23180 | |
| 5 | 21355 | |
| 1 | 21111 | |
| 6 | 20082 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 10748 | |
| / | 1 | < 0.1% |
| . | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 272632 | |
| Latin | 87721 | 24.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 27232 | |
| W | 9690 | 11.0% |
| O | 9358 | 10.7% |
| Q | 7270 | 8.3% |
| N | 6092 | 6.9% |
| F | 4770 | 5.4% |
| J | 4063 | 4.6% |
| H | 3581 | 4.1% |
| K | 3129 | 3.6% |
| P | 2661 | 3.0% |
| Other values (11) | 9875 | 11.3% |
Common
| Value | Count | Frequency (%) |
| 8 | 34014 | |
| 3 | 32561 | |
| 4 | 30927 | |
| 9 | 27656 | |
| 2 | 27065 | |
| 7 | 23931 | |
| 0 | 23180 | |
| 5 | 21355 | |
| 1 | 21111 | |
| 6 | 20082 | |
| Other values (3) | 10750 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 360353 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 34014 | 9.4% |
| 3 | 32561 | 9.0% |
| 4 | 30927 | 8.6% |
| 9 | 27656 | 7.7% |
| M | 27232 | 7.6% |
| 2 | 27065 | 7.5% |
| 7 | 23931 | 6.6% |
| 0 | 23180 | 6.4% |
| 5 | 21355 | 5.9% |
| 1 | 21111 | 5.9% |
| Other values (24) | 91321 |
Missing 
| Distinct | 28684 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 193547 |
| Missing (%) | 57.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 282633 |
|---|---|
| Median length | 61 |
| Mean length | 83.85489841 |
| Min length | 1 |
Unique
| Unique | 19947 ? |
|---|---|
| Unique (%) | 13.8% |
Sample
| 1st row | One leg removed for genetic sampling while on loan to GUELPH. |
|---|---|
| 2nd row | Order: 10948; Box Number: MBARI_0136: Box Position: B/4 |
| 3rd row | One leg removed for genetic sampling while on loan to GUELPH. |
| 4th row | Originally cataloged as an image record because field notes indicated there was a photovoucher for the specimen. When the images were cataloged in early 2020, no photos were found for this specimen so the record was changed to a Genetic Sample (DNA) with no voucher. |
| 5th row | Entire tissue sample consumed for DNA extraction. Specimen voucher located at Museum National d'Histoire Naturelle, Paris. |
| Value | Count | Frequency (%) |
| for | 114846 | 5.9% |
| on | 113429 | 5.8% |
| to | 111972 | 5.7% |
| genetic | 110770 | 5.7% |
| while | 109786 | 5.6% |
| sampling | 108913 | 5.6% |
| loan | 108870 | 5.6% |
| removed | 108857 | 5.6% |
| guelph | 108797 | 5.6% |
| one | 105620 | 5.4% |
| Other values (46309) | 846419 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1744442 | 14.4% | |
| e | 1113548 | 9.2% |
| o | 806325 | 6.7% |
| n | 721471 | 6.0% |
| l | 597748 | 4.9% |
| i | 588547 | 4.9% |
| a | 470216 | 3.9% |
| t | 429603 | 3.5% |
| r | 425976 | 3.5% |
| g | 361714 | 3.0% |
| Other values (120) | 4861384 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7502487 | |
| Space Separator | 1744442 | 14.4% |
| Uppercase Letter | 1548802 | 12.8% |
| Decimal Number | 633553 | 5.2% |
| Other Punctuation | 420227 | 3.5% |
| Control | 177159 | 1.5% |
| Dash Punctuation | 41131 | 0.3% |
| Connector Punctuation | 24577 | 0.2% |
| Math Symbol | 18238 | 0.2% |
| Open Punctuation | 5170 | < 0.1% |
| Other values (4) | 5188 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1113548 | |
| o | 806325 | |
| n | 721471 | |
| l | 597748 | 8.0% |
| i | 588547 | 7.8% |
| a | 470216 | 6.3% |
| t | 429603 | 5.7% |
| r | 425976 | 5.7% |
| g | 361714 | 4.8% |
| m | 297830 | 4.0% |
| Other values (44) | 1689509 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 148874 | |
| O | 145398 | 9.4% |
| G | 142014 | 9.2% |
| E | 136146 | 8.8% |
| U | 129096 | 8.3% |
| H | 128358 | 8.3% |
| L | 121543 | 7.8% |
| N | 77084 | 5.0% |
| B | 75895 | 4.9% |
| M | 72763 | 4.7% |
| Other values (20) | 371631 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 182353 | |
| : | 91346 | |
| ; | 73034 | |
| , | 40116 | 9.5% |
| / | 22748 | 5.4% |
| ' | 3166 | 0.8% |
| " | 3111 | 0.7% |
| # | 2404 | 0.6% |
| & | 1284 | 0.3% |
| ? | 550 | 0.1% |
| Other values (4) | 115 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 116594 | |
| 0 | 94063 | |
| 2 | 75391 | |
| 9 | 57354 | |
| 3 | 52443 | |
| 4 | 50558 | |
| 8 | 47354 | |
| 7 | 47090 | |
| 5 | 46487 | 7.3% |
| 6 | 46219 | 7.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 17759 | |
| = | 377 | 2.1% |
| + | 59 | 0.3% |
| ~ | 17 | 0.1% |
| < | 16 | 0.1% |
| > | 10 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4350 | |
| ] | 809 | 15.7% |
| } | 6 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 12 | |
| ♀ | 5 | |
| ♂ | 4 | 19.0% |
Control
| Value | Count | Frequency (%) |
| 176366 | ||
| 793 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40888 | |
| — | 243 | 0.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4358 | |
| [ | 812 | 15.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1744442 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 24577 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9051277 | |
| Common | 3069697 | 25.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1113548 | 12.3% |
| o | 806325 | 8.9% |
| n | 721471 | 8.0% |
| l | 597748 | 6.6% |
| i | 588547 | 6.5% |
| a | 470216 | 5.2% |
| t | 429603 | 4.7% |
| r | 425976 | 4.7% |
| g | 361714 | 4.0% |
| m | 297830 | 3.3% |
| Other values (73) | 3238299 |
Common
| Value | Count | Frequency (%) |
| 1744442 | ||
| . | 182353 | 5.9% |
| 176366 | 5.7% | |
| 1 | 116594 | 3.8% |
| 0 | 94063 | 3.1% |
| : | 91346 | 3.0% |
| 2 | 75391 | 2.5% |
| ; | 73034 | 2.4% |
| 9 | 57354 | 1.9% |
| 3 | 52443 | 1.7% |
| Other values (37) | 406311 | 13.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12120332 | |
| None | 386 | < 0.1% |
| Punctuation | 243 | < 0.1% |
| Misc Symbols | 9 | < 0.1% |
| Latin Ext Additional | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1744442 | 14.4% | |
| e | 1113548 | 9.2% |
| o | 806325 | 6.7% |
| n | 721471 | 6.0% |
| l | 597748 | 4.9% |
| i | 588547 | 4.9% |
| a | 470216 | 3.9% |
| t | 429603 | 3.5% |
| r | 425976 | 3.5% |
| g | 361714 | 3.0% |
| Other values (81) | 4860742 |
Punctuation
| Value | Count | Frequency (%) |
| — | 243 |
None
| Value | Count | Frequency (%) |
| é | 109 | |
| í | 41 | 10.6% |
| ü | 36 | 9.3% |
| ã | 29 | 7.5% |
| Î | 27 | 7.0% |
| ó | 19 | 4.9% |
| á | 16 | 4.1% |
| è | 15 | 3.9% |
| µ | 12 | 3.1% |
| ° | 12 | 3.1% |
| Other values (22) | 70 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♀ | 5 | |
| ♂ | 4 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 1 | |
| ả | 1 | |
| ắ | 1 | |
| ọ | 1 |
organismName
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | EML |
|---|
| Value | Count | Frequency (%) |
| eml | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
organismScope
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-01T12:07:33.811Z |
|---|
| Value | Count | Frequency (%) |
| 2024-12-01t12:07:33.811z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 4 | |
| 0 | 3 | |
| - | 2 | 8.3% |
| : | 2 | 8.3% |
| 3 | 2 | 8.3% |
| 4 | 1 | 4.2% |
| T | 1 | 4.2% |
| 7 | 1 | 4.2% |
| . | 1 | 4.2% |
| Other values (2) | 2 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17 | |
| Other Punctuation | 3 | 12.5% |
| Dash Punctuation | 2 | 8.3% |
| Uppercase Letter | 2 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 4 | |
| 0 | 3 | |
| 3 | 2 | 11.8% |
| 4 | 1 | 5.9% |
| 7 | 1 | 5.9% |
| 8 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 2 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 4 | |
| 0 | 3 | |
| - | 2 | 9.1% |
| : | 2 | 9.1% |
| 3 | 2 | 9.1% |
| 4 | 1 | 4.5% |
| 7 | 1 | 4.5% |
| . | 1 | 4.5% |
| 8 | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 4 | |
| 0 | 3 | |
| - | 2 | 8.3% |
| : | 2 | 8.3% |
| 3 | 2 | 8.3% |
| 4 | 1 | 4.2% |
| T | 1 | 4.2% |
| 7 | 1 | 4.2% |
| . | 1 | 4.2% |
| Other values (2) | 2 | 8.3% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-01T11:07:21.711Z |
|---|
| Value | Count | Frequency (%) |
| 2024-12-01t11:07:21.711z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 2 | 4 | |
| 0 | 3 | |
| - | 2 | 8.3% |
| : | 2 | 8.3% |
| 7 | 2 | 8.3% |
| 4 | 1 | 4.2% |
| T | 1 | 4.2% |
| . | 1 | 4.2% |
| Z | 1 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17 | |
| Other Punctuation | 3 | 12.5% |
| Dash Punctuation | 2 | 8.3% |
| Uppercase Letter | 2 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 2 | 4 | |
| 0 | 3 | |
| 7 | 2 | 11.8% |
| 4 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 2 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 2 | 4 | |
| 0 | 3 | |
| - | 2 | 9.1% |
| : | 2 | 9.1% |
| 7 | 2 | 9.1% |
| 4 | 1 | 4.5% |
| . | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 2 | 4 | |
| 0 | 3 | |
| - | 2 | 8.3% |
| : | 2 | 8.3% |
| 7 | 2 | 8.3% |
| 4 | 1 | 4.2% |
| T | 1 | 4.2% |
| . | 1 | 4.2% |
| Z | 1 | 4.2% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | true |
|---|
| Value | Count | Frequency (%) |
| true | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | false |
|---|
| Value | Count | Frequency (%) |
| false | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
verbatimLabel
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338089 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 8 |
| Min length | 6 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 10.6925 |
|---|---|
| 2nd row | 5.55461 |
| 3rd row | LATIN_AMERICA |
| 4th row | 7.1633 |
| 5th row | 5.80961 |
| Value | Count | Frequency (%) |
| 10.6925 | 1 | |
| 5.55461 | 1 | |
| latin_america | 1 | |
| 7.1633 | 1 | |
| 5.80961 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | 10.0% |
| . | 4 | 10.0% |
| 6 | 4 | 10.0% |
| A | 3 | 7.5% |
| 9 | 2 | 5.0% |
| 3 | 2 | 5.0% |
| 0 | 2 | 5.0% |
| I | 2 | 5.0% |
| M | 1 | 2.5% |
| Other values (11) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23 | |
| Uppercase Letter | 12 | |
| Other Punctuation | 4 | 10.0% |
| Connector Punctuation | 1 | 2.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | |
| 6 | 4 | |
| 9 | 2 | 8.7% |
| 3 | 2 | 8.7% |
| 0 | 2 | 8.7% |
| 7 | 1 | 4.3% |
| 4 | 1 | 4.3% |
| 2 | 1 | 4.3% |
| 8 | 1 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 | |
| I | 2 | |
| M | 1 | 8.3% |
| C | 1 | 8.3% |
| R | 1 | 8.3% |
| E | 1 | 8.3% |
| T | 1 | 8.3% |
| N | 1 | 8.3% |
| L | 1 | 8.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28 | |
| Latin | 12 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | |
| . | 4 | |
| 6 | 4 | |
| 9 | 2 | 7.1% |
| 3 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 7 | 1 | 3.6% |
| _ | 1 | 3.6% |
| 4 | 1 | 3.6% |
| Other values (2) | 2 | 7.1% |
Latin
| Value | Count | Frequency (%) |
| A | 3 | |
| I | 2 | |
| M | 1 | 8.3% |
| C | 1 | 8.3% |
| R | 1 | 8.3% |
| E | 1 | 8.3% |
| T | 1 | 8.3% |
| N | 1 | 8.3% |
| L | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | 10.0% |
| . | 4 | 10.0% |
| 6 | 4 | 10.0% |
| A | 3 | 7.5% |
| 9 | 2 | 5.0% |
| 3 | 2 | 5.0% |
| 0 | 2 | 5.0% |
| I | 2 | 5.0% |
| M | 1 | 2.5% |
| Other values (11) | 11 |
materialSampleID
Text
Missing 
| Distinct | 253108 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 84986 |
| Missing (%) | 25.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.000031607 |
| Min length | 7 |
Unique
| Unique | 253108 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AR5TC43 |
|---|---|
| 2nd row | AL2IC84 |
| 3rd row | AF9HI08 |
| 4th row | AD5JZ99 |
| 5th row | AE0OQ35 |
| Value | Count | Frequency (%) |
| ar5tc43 | 1 | < 0.1% |
| ae3rz90 | 1 | < 0.1% |
| am1rc30 | 1 | < 0.1% |
| al7ng44 | 1 | < 0.1% |
| an9jb30 | 1 | < 0.1% |
| af9hi08 | 1 | < 0.1% |
| ad5jz99 | 1 | < 0.1% |
| ae0oq35 | 1 | < 0.1% |
| an7hd65 | 1 | < 0.1% |
| ak3zy87 | 1 | < 0.1% |
| Other values (253098) | 253098 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 287625 | 16.2% |
| 7 | 79818 | 4.5% |
| 1 | 77867 | 4.4% |
| 2 | 77272 | 4.4% |
| 0 | 77025 | 4.3% |
| 4 | 76603 | 4.3% |
| 5 | 76332 | 4.3% |
| 3 | 76014 | 4.3% |
| 9 | 74989 | 4.2% |
| 6 | 73630 | 4.2% |
| Other values (29) | 794589 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1012424 | |
| Decimal Number | 759333 | |
| Other Punctuation | 4 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 287625 | |
| O | 39928 | 3.9% |
| R | 39126 | 3.9% |
| K | 38842 | 3.8% |
| E | 36151 | 3.6% |
| C | 35388 | 3.5% |
| L | 34783 | 3.4% |
| H | 34197 | 3.4% |
| I | 33715 | 3.3% |
| F | 33659 | 3.3% |
| Other values (16) | 399010 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 79818 | |
| 1 | 77867 | |
| 2 | 77272 | |
| 0 | 77025 | |
| 4 | 76603 | |
| 5 | 76332 | |
| 3 | 76014 | |
| 9 | 74989 | |
| 6 | 73630 | |
| 8 | 69783 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1012424 | |
| Common | 759340 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 287625 | |
| O | 39928 | 3.9% |
| R | 39126 | 3.9% |
| K | 38842 | 3.8% |
| E | 36151 | 3.6% |
| C | 35388 | 3.5% |
| L | 34783 | 3.4% |
| H | 34197 | 3.4% |
| I | 33715 | 3.3% |
| F | 33659 | 3.3% |
| Other values (16) | 399010 |
Common
| Value | Count | Frequency (%) |
| 7 | 79818 | |
| 1 | 77867 | |
| 2 | 77272 | |
| 0 | 77025 | |
| 4 | 76603 | |
| 5 | 76332 | |
| 3 | 76014 | |
| 9 | 74989 | |
| 6 | 73630 | |
| 8 | 69783 | |
| Other values (3) | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1771764 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 287625 | 16.2% |
| 7 | 79818 | 4.5% |
| 1 | 77867 | 4.4% |
| 2 | 77272 | 4.4% |
| 0 | 77025 | 4.3% |
| 4 | 76603 | 4.3% |
| 5 | 76332 | 4.3% |
| 3 | 76014 | 4.3% |
| 9 | 74989 | 4.2% |
| 6 | 73630 | 4.2% |
| Other values (29) | 794589 |
eventID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338092 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 94648.0 |
|---|---|
| 2nd row | PAN |
| Value | Count | Frequency (%) |
| 94648.0 | 1 | |
| pan | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| . | 1 | |
| 0 | 1 | |
| P | 1 | |
| A | 1 | |
| N | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Uppercase Letter | 3 | |
| Other Punctuation | 1 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| 0 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 | |
| Latin | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| . | 1 | |
| 0 | 1 |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| . | 1 | |
| 0 | 1 | |
| P | 1 | |
| A | 1 | |
| N | 1 |
parentEventID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Panama |
|---|
| Value | Count | Frequency (%) |
| panama | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| P | 1 | 16.7% |
| n | 1 | 16.7% |
| m | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Uppercase Letter | 1 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 1 | 20.0% |
| m | 1 | 20.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| P | 1 | 16.7% |
| n | 1 | 16.7% |
| m | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| P | 1 | 16.7% |
| n | 1 | 16.7% |
| m | 1 | 16.7% |
eventType
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | PAN.5_1 |
|---|
| Value | Count | Frequency (%) |
| pan.5_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 | |
| . | 1 | |
| 5 | 1 | |
| _ | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3 | |
| Decimal Number | 2 | |
| Other Punctuation | 1 | 14.3% |
| Connector Punctuation | 1 | 14.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 1 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 | |
| Latin | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 1 | |
| 5 | 1 | |
| _ | 1 | |
| 1 | 1 |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 | |
| . | 1 | |
| 5 | 1 | |
| _ | 1 | |
| 1 | 1 |
fieldNumber
Text
Missing 
| Distinct | 7065 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 267153 |
| Missing (%) | 79.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 43 |
| Mean length | 11.55329076 |
| Min length | 1 |
Unique
| Unique | 2664 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | MBARI/T548 |
|---|---|
| 2nd row | MBIO/BIZ-231 |
| 3rd row | Moorea F-06-12 |
| 4th row | MBARI/T488 |
| 5th row | AL-4097 |
| Value | Count | Frequency (%) |
| cb | 3399 | 3.7% |
| moorea | 3150 | 3.5% |
| fp | 1215 | 1.3% |
| lrp | 1032 | 1.1% |
| bah | 989 | 1.1% |
| tob | 834 | 0.9% |
| cur | 810 | 0.9% |
| mbio/080611_minv_014 | 626 | 0.7% |
| dgs | 506 | 0.6% |
| sec18-07 | 504 | 0.6% |
| Other values (7236) | 78011 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 78614 | 9.6% |
| - | 70093 | 8.6% |
| 1 | 62300 | 7.6% |
| B | 45896 | 5.6% |
| 2 | 43979 | 5.4% |
| I | 35433 | 4.3% |
| M | 34390 | 4.2% |
| A | 34133 | 4.2% |
| 3 | 27058 | 3.3% |
| 8 | 21657 | 2.6% |
| Other values (63) | 366049 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 325819 | |
| Decimal Number | 325257 | |
| Dash Punctuation | 70093 | 8.6% |
| Lowercase Letter | 34318 | 4.2% |
| Other Punctuation | 26173 | 3.2% |
| Space Separator | 20135 | 2.5% |
| Connector Punctuation | 17766 | 2.2% |
| Math Symbol | 37 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 45896 | |
| I | 35433 | |
| M | 34390 | |
| A | 34133 | |
| R | 18890 | 5.8% |
| S | 18631 | 5.7% |
| O | 17074 | 5.2% |
| C | 16693 | 5.1% |
| L | 16123 | 4.9% |
| U | 13273 | 4.1% |
| Other values (16) | 75283 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7972 | |
| e | 4714 | |
| r | 3982 | |
| a | 3736 | |
| n | 2353 | 6.9% |
| i | 1842 | 5.4% |
| m | 1834 | 5.3% |
| t | 1780 | 5.2% |
| v | 1564 | 4.6% |
| l | 1062 | 3.1% |
| Other values (15) | 3479 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 78614 | |
| 1 | 62300 | |
| 2 | 43979 | |
| 3 | 27058 | 8.3% |
| 8 | 21657 | 6.7% |
| 6 | 20171 | 6.2% |
| 4 | 19688 | 6.1% |
| 7 | 19207 | 5.9% |
| 5 | 17520 | 5.4% |
| 9 | 15063 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 21508 | |
| ; | 4624 | 17.7% |
| . | 14 | 0.1% |
| # | 14 | 0.1% |
| : | 12 | < 0.1% |
| , | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 70093 |
Space Separator
| Value | Count | Frequency (%) |
| 20135 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 17766 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 37 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 459465 | |
| Latin | 360137 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 45896 | |
| I | 35433 | 9.8% |
| M | 34390 | 9.5% |
| A | 34133 | 9.5% |
| R | 18890 | 5.2% |
| S | 18631 | 5.2% |
| O | 17074 | 4.7% |
| C | 16693 | 4.6% |
| L | 16123 | 4.5% |
| U | 13273 | 3.7% |
| Other values (41) | 109601 |
Common
| Value | Count | Frequency (%) |
| 0 | 78614 | |
| - | 70093 | |
| 1 | 62300 | |
| 2 | 43979 | |
| 3 | 27058 | 5.9% |
| 8 | 21657 | 4.7% |
| / | 21508 | 4.7% |
| 6 | 20171 | 4.4% |
| 20135 | 4.4% | |
| 4 | 19688 | 4.3% |
| Other values (12) | 74262 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 819601 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 78614 | 9.6% |
| - | 70093 | 8.6% |
| 1 | 62300 | 7.6% |
| B | 45896 | 5.6% |
| 2 | 43979 | 5.4% |
| I | 35433 | 4.3% |
| M | 34390 | 4.2% |
| A | 34133 | 4.2% |
| 3 | 27058 | 3.3% |
| 8 | 21657 | 2.6% |
| Other values (62) | 366048 |
None
| Value | Count | Frequency (%) |
| é | 1 |
eventDate
Text
Missing 
| Distinct | 23011 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 16903 |
| Missing (%) | 5.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 11.0585446 |
| Min length | 4 |
Unique
| Unique | 1361 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1977-05-21 |
|---|---|
| 2nd row | 2003-04-05 |
| 3rd row | 2009-12-05 |
| 4th row | 2006-09-14 |
| 5th row | 2003-05-01/2003-05-13 |
| Value | Count | Frequency (%) |
| 2018-03-19/2018-03-23 | 1119 | 0.3% |
| 2016-02-22/2016-03-09 | 840 | 0.3% |
| 2008-06-11 | 649 | 0.2% |
| 2017-05-26 | 623 | 0.2% |
| 2015-05-09 | 524 | 0.2% |
| 2017-05-23 | 518 | 0.2% |
| 2017-05-30 | 515 | 0.2% |
| 2006-03-12 | 513 | 0.2% |
| 2017-08-14 | 508 | 0.2% |
| 2017-05-27 | 505 | 0.2% |
| Other values (23001) | 314877 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 767941 | |
| - | 702212 | |
| 1 | 575030 | |
| 2 | 411303 | |
| 9 | 302111 | 8.5% |
| 8 | 151019 | 4.3% |
| 7 | 140049 | 3.9% |
| 6 | 130946 | 3.7% |
| 5 | 122995 | 3.5% |
| 3 | 119921 | 3.4% |
| Other values (7) | 128378 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2817587 | |
| Dash Punctuation | 702212 | 19.8% |
| Other Punctuation | 32102 | 0.9% |
| Uppercase Letter | 3 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 767941 | |
| 1 | 575030 | |
| 2 | 411303 | |
| 9 | 302111 | 10.7% |
| 8 | 151019 | 5.4% |
| 7 | 140049 | 5.0% |
| 6 | 130946 | 4.6% |
| 5 | 122995 | 4.4% |
| 3 | 119921 | 4.3% |
| 4 | 96272 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 32100 | |
| . | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 702212 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3551902 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 767941 | |
| - | 702212 | |
| 1 | 575030 | |
| 2 | 411303 | |
| 9 | 302111 | 8.5% |
| 8 | 151019 | 4.3% |
| 7 | 140049 | 3.9% |
| 6 | 130946 | 3.7% |
| 5 | 122995 | 3.5% |
| 3 | 119921 | 3.4% |
| Other values (4) | 128375 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3551905 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 767941 | |
| - | 702212 | |
| 1 | 575030 | |
| 2 | 411303 | |
| 9 | 302111 | 8.5% |
| 8 | 151019 | 4.3% |
| 7 | 140049 | 3.9% |
| 6 | 130946 | 3.7% |
| 5 | 122995 | 3.5% |
| 3 | 119921 | 3.4% |
| Other values (7) | 128378 | 3.6% |
eventTime
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338093 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Pinogana |
|---|
| Value | Count | Frequency (%) |
| pinogana | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| P | 1 | |
| i | 1 | |
| o | 1 | |
| g | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| i | 1 | |
| o | 1 | |
| g | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| P | 1 | |
| i | 1 | |
| o | 1 | |
| g | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| P | 1 | |
| i | 1 | |
| o | 1 | |
| g | 1 |
startDayOfYear
Text
Missing 
| Distinct | 367 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 19910 |
| Missing (%) | 5.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 3 |
| Mean length | 2.770698715 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 141 |
|---|---|
| 2nd row | 95 |
| 3rd row | 339 |
| 4th row | 257 |
| 5th row | 121 |
| Value | Count | Frequency (%) |
| 142 | 2410 | 0.8% |
| 78 | 1961 | 0.6% |
| 140 | 1910 | 0.6% |
| 147 | 1848 | 0.6% |
| 201 | 1848 | 0.6% |
| 182 | 1823 | 0.6% |
| 152 | 1822 | 0.6% |
| 197 | 1811 | 0.6% |
| 150 | 1809 | 0.6% |
| 146 | 1797 | 0.6% |
| Other values (357) | 299145 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 192507 | |
| 2 | 156032 | |
| 3 | 99733 | |
| 4 | 66047 | 7.5% |
| 5 | 65028 | 7.4% |
| 7 | 63233 | 7.2% |
| 6 | 60765 | 6.9% |
| 0 | 60705 | 6.9% |
| 8 | 59613 | 6.8% |
| 9 | 57922 | 6.6% |
| Other values (5) | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 881585 | |
| Other Punctuation | 3 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 192507 | |
| 2 | 156032 | |
| 3 | 99733 | |
| 4 | 66047 | 7.5% |
| 5 | 65028 | 7.4% |
| 7 | 63233 | 7.2% |
| 6 | 60765 | 6.9% |
| 0 | 60705 | 6.9% |
| 8 | 59613 | 6.8% |
| 9 | 57922 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 881589 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 192507 | |
| 2 | 156032 | |
| 3 | 99733 | |
| 4 | 66047 | 7.5% |
| 5 | 65028 | 7.4% |
| 7 | 63233 | 7.2% |
| 6 | 60765 | 6.9% |
| 0 | 60705 | 6.9% |
| 8 | 59613 | 6.8% |
| 9 | 57922 | 6.6% |
| Other values (2) | 4 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| A | 1 | |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 881592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 192507 | |
| 2 | 156032 | |
| 3 | 99733 | |
| 4 | 66047 | 7.5% |
| 5 | 65028 | 7.4% |
| 7 | 63233 | 7.2% |
| 6 | 60765 | 6.9% |
| 0 | 60705 | 6.9% |
| 8 | 59613 | 6.8% |
| 9 | 57922 | 6.6% |
| Other values (5) | 7 | < 0.1% |
endDayOfYear
Text
Missing 
| Distinct | 367 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 19910 |
| Missing (%) | 5.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 3 |
| Mean length | 2.778879516 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 141 |
|---|---|
| 2nd row | 95 |
| 3rd row | 339 |
| 4th row | 257 |
| 5th row | 133 |
| Value | Count | Frequency (%) |
| 142 | 2344 | 0.7% |
| 151 | 2030 | 0.6% |
| 150 | 2012 | 0.6% |
| 82 | 1896 | 0.6% |
| 69 | 1863 | 0.6% |
| 143 | 1854 | 0.6% |
| 212 | 1815 | 0.6% |
| 197 | 1800 | 0.6% |
| 146 | 1790 | 0.6% |
| 147 | 1756 | 0.6% |
| Other values (359) | 299026 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 188897 | |
| 2 | 158630 | |
| 3 | 100399 | |
| 4 | 66204 | 7.5% |
| 5 | 64939 | 7.3% |
| 0 | 62735 | 7.1% |
| 6 | 61927 | 7.0% |
| 7 | 61685 | 7.0% |
| 8 | 59642 | 6.7% |
| 9 | 59125 | 6.7% |
| Other values (11) | 12 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 884183 | |
| Lowercase Letter | 8 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 188897 | |
| 2 | 158630 | |
| 3 | 100399 | |
| 4 | 66204 | 7.5% |
| 5 | 64939 | 7.3% |
| 0 | 62735 | 7.1% |
| 6 | 61927 | 7.0% |
| 7 | 61685 | 7.0% |
| 8 | 59642 | 6.7% |
| 9 | 59125 | 6.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1 | |
| p | 1 | |
| u | 1 | |
| d | 1 | |
| a | 1 | |
| c | 1 | |
| o | 1 | |
| é | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| B | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 884185 | |
| Latin | 10 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 188897 | |
| 2 | 158630 | |
| 3 | 100399 | |
| 4 | 66204 | 7.5% |
| 5 | 64939 | 7.3% |
| 0 | 62735 | 7.1% |
| 6 | 61927 | 7.0% |
| 7 | 61685 | 7.0% |
| 8 | 59642 | 6.7% |
| 9 | 59125 | 6.7% |
Latin
| Value | Count | Frequency (%) |
| e | 1 | |
| p | 1 | |
| u | 1 | |
| C | 1 | |
| B | 1 | |
| d | 1 | |
| a | 1 | |
| c | 1 | |
| o | 1 | |
| é | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 884194 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 188897 | |
| 2 | 158630 | |
| 3 | 100399 | |
| 4 | 66204 | 7.5% |
| 5 | 64939 | 7.3% |
| 0 | 62735 | 7.1% |
| 6 | 61927 | 7.0% |
| 7 | 61685 | 7.0% |
| 8 | 59642 | 6.7% |
| 9 | 59125 | 6.7% |
| Other values (10) | 11 | < 0.1% |
None
| Value | Count | Frequency (%) |
| é | 1 |
year
Text
Missing 
| Distinct | 157 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17140 |
| Missing (%) | 5.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.999993769 |
| Min length | 2 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1977 |
|---|---|
| 2nd row | 2003 |
| 3rd row | 2009 |
| 4th row | 2006 |
| 5th row | 2003 |
| Value | Count | Frequency (%) |
| 2009 | 14234 | 4.4% |
| 2017 | 14047 | 4.4% |
| 2015 | 13798 | 4.3% |
| 2010 | 13719 | 4.3% |
| 2012 | 12169 | 3.8% |
| 2008 | 11965 | 3.7% |
| 2016 | 11441 | 3.6% |
| 2018 | 11080 | 3.5% |
| 2019 | 9897 | 3.1% |
| 2006 | 9359 | 2.9% |
| Other values (147) | 199245 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 294854 | |
| 1 | 274326 | |
| 2 | 222206 | |
| 9 | 214084 | |
| 8 | 70429 | 5.5% |
| 7 | 60121 | 4.7% |
| 6 | 50143 | 3.9% |
| 5 | 37703 | 2.9% |
| 3 | 30502 | 2.4% |
| 4 | 29444 | 2.3% |
| Other values (2) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1283812 | |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 294854 | |
| 1 | 274326 | |
| 2 | 222206 | |
| 9 | 214084 | |
| 8 | 70429 | 5.5% |
| 7 | 60121 | 4.7% |
| 6 | 50143 | 3.9% |
| 5 | 37703 | 2.9% |
| 3 | 30502 | 2.4% |
| 4 | 29444 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 | |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1283812 | |
| Latin | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 294854 | |
| 1 | 274326 | |
| 2 | 222206 | |
| 9 | 214084 | |
| 8 | 70429 | 5.5% |
| 7 | 60121 | 4.7% |
| 6 | 50143 | 3.9% |
| 5 | 37703 | 2.9% |
| 3 | 30502 | 2.4% |
| 4 | 29444 | 2.3% |
Latin
| Value | Count | Frequency (%) |
| L | 1 | |
| C | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1283814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 294854 | |
| 1 | 274326 | |
| 2 | 222206 | |
| 9 | 214084 | |
| 8 | 70429 | 5.5% |
| 7 | 60121 | 4.7% |
| 6 | 50143 | 3.9% |
| 5 | 37703 | 2.9% |
| 3 | 30502 | 2.4% |
| 4 | 29444 | 2.3% |
| Other values (2) | 2 | < 0.1% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 22792 |
| Missing (%) | 6.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.179186938 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 4 |
| 3rd row | 12 |
| 4th row | 9 |
| 5th row | 5 |
| Value | Count | Frequency (%) |
| 5 | 42126 | |
| 6 | 36817 | |
| 7 | 36682 | |
| 8 | 30685 | |
| 4 | 28521 | |
| 3 | 27357 | |
| 9 | 25336 | |
| 10 | 23088 | |
| 11 | 20226 | |
| 2 | 15793 | 5.0% |
| Other values (2) | 28671 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 92211 | |
| 5 | 42126 | |
| 6 | 36817 | 9.9% |
| 7 | 36682 | 9.9% |
| 8 | 30685 | 8.3% |
| 2 | 28977 | 7.8% |
| 4 | 28521 | 7.7% |
| 3 | 27357 | 7.4% |
| 9 | 25336 | 6.8% |
| 0 | 23088 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 371800 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 92211 | |
| 5 | 42126 | |
| 6 | 36817 | 9.9% |
| 7 | 36682 | 9.9% |
| 8 | 30685 | 8.3% |
| 2 | 28977 | 7.8% |
| 4 | 28521 | 7.7% |
| 3 | 27357 | 7.4% |
| 9 | 25336 | 6.8% |
| 0 | 23088 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 371800 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 92211 | |
| 5 | 42126 | |
| 6 | 36817 | 9.9% |
| 7 | 36682 | 9.9% |
| 8 | 30685 | 8.3% |
| 2 | 28977 | 7.8% |
| 4 | 28521 | 7.7% |
| 3 | 27357 | 7.4% |
| 9 | 25336 | 6.8% |
| 0 | 23088 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 371800 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 92211 | |
| 5 | 42126 | |
| 6 | 36817 | 9.9% |
| 7 | 36682 | 9.9% |
| 8 | 30685 | 8.3% |
| 2 | 28977 | 7.8% |
| 4 | 28521 | 7.7% |
| 3 | 27357 | 7.4% |
| 9 | 25336 | 6.8% |
| 0 | 23088 | 6.2% |
day
Text
Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52010 |
| Missing (%) | 15.4% |
| Memory size | 2.6 MiB |
Length
| Max length | 96 |
|---|---|
| Median length | 2 |
| Mean length | 1.706149243 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 21 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 14 |
| 5th row | 14 |
| Value | Count | Frequency (%) |
| 16 | 10591 | 3.7% |
| 11 | 10590 | 3.7% |
| 8 | 10195 | 3.6% |
| 10 | 10187 | 3.6% |
| 5 | 10116 | 3.5% |
| 15 | 10084 | 3.5% |
| 12 | 10050 | 3.5% |
| 14 | 9885 | 3.5% |
| 7 | 9698 | 3.4% |
| 22 | 9650 | 3.4% |
| Other values (35) | 185051 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 130494 | |
| 2 | 119138 | |
| 3 | 40129 | 8.2% |
| 8 | 29336 | 6.0% |
| 6 | 29314 | 6.0% |
| 5 | 28584 | 5.9% |
| 7 | 28048 | 5.7% |
| 0 | 28006 | 5.7% |
| 4 | 27568 | 5.6% |
| 9 | 27393 | 5.6% |
| Other values (30) | 92 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 488010 | |
| Lowercase Letter | 64 | < 0.1% |
| Space Separator | 13 | < 0.1% |
| Uppercase Letter | 9 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 7 | |
| r | 7 | |
| a | 5 | |
| c | 4 | 6.2% |
| i | 4 | 6.2% |
| t | 4 | 6.2% |
| n | 4 | 6.2% |
| s | 3 | 4.7% |
| d | 3 | 4.7% |
| Other values (9) | 12 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 130494 | |
| 2 | 119138 | |
| 3 | 40129 | 8.2% |
| 8 | 29336 | 6.0% |
| 6 | 29314 | 6.0% |
| 5 | 28584 | 5.9% |
| 7 | 28048 | 5.7% |
| 0 | 28006 | 5.7% |
| 4 | 27568 | 5.6% |
| 9 | 27393 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 3 | |
| P | 2 | |
| B | 1 | 11.1% |
| C | 1 | 11.1% |
| W | 1 | 11.1% |
| E | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 13 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 488029 | |
| Latin | 73 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 7 | 9.6% |
| r | 7 | 9.6% |
| a | 5 | 6.8% |
| c | 4 | 5.5% |
| i | 4 | 5.5% |
| t | 4 | 5.5% |
| n | 4 | 5.5% |
| s | 3 | 4.1% |
| d | 3 | 4.1% |
| Other values (15) | 21 |
Common
| Value | Count | Frequency (%) |
| 1 | 130494 | |
| 2 | 119138 | |
| 3 | 40129 | 8.2% |
| 8 | 29336 | 6.0% |
| 6 | 29314 | 6.0% |
| 5 | 28584 | 5.9% |
| 7 | 28048 | 5.7% |
| 0 | 28006 | 5.7% |
| 4 | 27568 | 5.6% |
| 9 | 27393 | 5.6% |
| Other values (5) | 19 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 488102 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 130494 | |
| 2 | 119138 | |
| 3 | 40129 | 8.2% |
| 8 | 29336 | 6.0% |
| 6 | 29314 | 6.0% |
| 5 | 28584 | 5.9% |
| 7 | 28048 | 5.7% |
| 0 | 28006 | 5.7% |
| 4 | 27568 | 5.6% |
| 9 | 27393 | 5.6% |
| Other values (30) | 92 | < 0.1% |
Missing 
| Distinct | 10231 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 235843 |
| Missing (%) | 69.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 71 |
| Mean length | 13.69981712 |
| Min length | 1 |
Unique
| Unique | 2669 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | 4/5/2003 3:59:00 PM |
|---|---|
| 2nd row | 2007 or prior, based on filename of source data sheet |
| 3rd row | 14 Sep 2006 |
| 4th row | 10/11/2002 1:30:00 PM |
| 5th row | 11 May 2014 |
| Value | Count | Frequency (%) |
| may | 10951 | 3.6% |
| apr | 6716 | 2.2% |
| pm | 6650 | 2.2% |
| aug | 5881 | 1.9% |
| 5371 | 1.8% | |
| 2007 | 5227 | 1.7% |
| sep | 5183 | 1.7% |
| mar | 4904 | 1.6% |
| 2008 | 4654 | 1.5% |
| june | 4026 | 1.3% |
| Other values (3776) | 242866 |
Most occurring characters
| Value | Count | Frequency (%) |
| 200178 | 14.3% | |
| 0 | 158992 | 11.3% |
| 1 | 143302 | 10.2% |
| 2 | 117701 | 8.4% |
| 9 | 73185 | 5.2% |
| e | 40939 | 2.9% |
| 8 | 37291 | 2.7% |
| a | 35798 | 2.6% |
| 3 | 32860 | 2.3% |
| r | 32280 | 2.3% |
| Other values (66) | 528294 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 670885 | |
| Lowercase Letter | 313740 | |
| Space Separator | 200178 | 14.3% |
| Uppercase Letter | 112844 | 8.1% |
| Other Punctuation | 64360 | 4.6% |
| Dash Punctuation | 30822 | 2.2% |
| Open Punctuation | 3948 | 0.3% |
| Close Punctuation | 3948 | 0.3% |
| Math Symbol | 81 | < 0.1% |
| Connector Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 40939 | |
| a | 35798 | |
| r | 32280 | |
| u | 27456 | |
| t | 25372 | 8.1% |
| p | 19853 | 6.3% |
| n | 18252 | 5.8% |
| y | 16737 | 5.3% |
| o | 15204 | 4.8% |
| c | 13230 | 4.2% |
| Other values (15) | 68619 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 25454 | |
| J | 20868 | |
| A | 19538 | |
| S | 12302 | |
| N | 7617 | 6.8% |
| P | 6955 | 6.2% |
| D | 5024 | 4.5% |
| O | 4895 | 4.3% |
| F | 4343 | 3.8% |
| E | 1480 | 1.3% |
| Other values (11) | 4368 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 32208 | |
| / | 13150 | |
| . | 8664 | 13.5% |
| ; | 8151 | 12.7% |
| , | 2129 | 3.3% |
| ? | 16 | < 0.1% |
| * | 15 | < 0.1% |
| ' | 9 | < 0.1% |
| & | 6 | < 0.1% |
| # | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 158992 | |
| 1 | 143302 | |
| 2 | 117701 | |
| 9 | 73185 | |
| 8 | 37291 | 5.6% |
| 3 | 32860 | 4.9% |
| 5 | 31291 | 4.7% |
| 7 | 28000 | 4.2% |
| 4 | 24299 | 3.6% |
| 6 | 23964 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30541 | |
| – | 281 | 0.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3930 | |
| ( | 18 | 0.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 3930 | |
| ) | 18 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 200178 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 81 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 974236 | |
| Latin | 426584 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 40939 | 9.6% |
| a | 35798 | 8.4% |
| r | 32280 | 7.6% |
| u | 27456 | 6.4% |
| M | 25454 | 6.0% |
| t | 25372 | 5.9% |
| J | 20868 | 4.9% |
| p | 19853 | 4.7% |
| A | 19538 | 4.6% |
| n | 18252 | 4.3% |
| Other values (36) | 160774 |
Common
| Value | Count | Frequency (%) |
| 200178 | ||
| 0 | 158992 | |
| 1 | 143302 | |
| 2 | 117701 | |
| 9 | 73185 | 7.5% |
| 8 | 37291 | 3.8% |
| 3 | 32860 | 3.4% |
| : | 32208 | 3.3% |
| 5 | 31291 | 3.2% |
| - | 30541 | 3.1% |
| Other values (20) | 116687 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1400539 | |
| Punctuation | 281 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 200178 | 14.3% | |
| 0 | 158992 | 11.4% |
| 1 | 143302 | 10.2% |
| 2 | 117701 | 8.4% |
| 9 | 73185 | 5.2% |
| e | 40939 | 2.9% |
| 8 | 37291 | 2.7% |
| a | 35798 | 2.6% |
| 3 | 32860 | 2.3% |
| r | 32280 | 2.3% |
| Other values (65) | 528013 |
Punctuation
| Value | Count | Frequency (%) |
| – | 281 |
habitat
Text
Missing 
| Distinct | 5074 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 302025 |
| Missing (%) | 89.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 382 |
|---|---|
| Median length | 180 |
| Mean length | 39.97022374 |
| Min length | 1 |
Unique
| Unique | 1915 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | Rocky slope with scattered shrubs. Moist soil on slope |
|---|---|
| 2nd row | Scrubland |
| 3rd row | Ecological remarks by collector(s): yes |
| 4th row | Cultivated/garden |
| 5th row | brushed from under rubble |
| Value | Count | Frequency (%) |
| forest | 9232 | 4.6% |
| and | 8075 | 4.0% |
| with | 6431 | 3.2% |
| by | 4851 | 2.4% |
| ecological | 4348 | 2.2% |
| remarks | 4348 | 2.2% |
| collector(s | 4343 | 2.2% |
| in | 4299 | 2.1% |
| yes | 3549 | 1.8% |
| slopes | 2419 | 1.2% |
| Other values (4257) | 150032 |
Most occurring characters
| Value | Count | Frequency (%) |
| 165858 | 11.5% | |
| e | 122882 | 8.5% |
| a | 115017 | 8.0% |
| r | 97779 | 6.8% |
| o | 97024 | 6.7% |
| s | 87680 | 6.1% |
| i | 77128 | 5.3% |
| n | 73943 | 5.1% |
| t | 69140 | 4.8% |
| l | 65307 | 4.5% |
| Other values (77) | 469928 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1158484 | |
| Space Separator | 165858 | 11.5% |
| Uppercase Letter | 57641 | 4.0% |
| Other Punctuation | 43972 | 3.1% |
| Open Punctuation | 5085 | 0.4% |
| Close Punctuation | 5081 | 0.4% |
| Decimal Number | 3126 | 0.2% |
| Dash Punctuation | 2287 | 0.2% |
| Math Symbol | 151 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 122882 | |
| a | 115017 | 9.9% |
| r | 97779 | 8.4% |
| o | 97024 | 8.4% |
| s | 87680 | 7.6% |
| i | 77128 | 6.7% |
| n | 73943 | 6.4% |
| t | 69140 | 6.0% |
| l | 65307 | 5.6% |
| c | 52978 | 4.6% |
| Other values (17) | 299606 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 6024 | 10.5% |
| C | 5575 | 9.7% |
| S | 5336 | 9.3% |
| A | 5260 | 9.1% |
| P | 4281 | 7.4% |
| M | 4222 | 7.3% |
| R | 4146 | 7.2% |
| B | 3101 | 5.4% |
| D | 2366 | 4.1% |
| G | 2064 | 3.6% |
| Other values (16) | 15266 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22792 | |
| . | 11645 | |
| : | 4679 | 10.6% |
| / | 2864 | 6.5% |
| ; | 1481 | 3.4% |
| & | 181 | 0.4% |
| % | 121 | 0.3% |
| " | 101 | 0.2% |
| ? | 72 | 0.2% |
| ' | 24 | 0.1% |
| Other values (2) | 12 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 714 | |
| 1 | 509 | |
| 2 | 394 | |
| 5 | 351 | |
| 3 | 258 | 8.3% |
| 8 | 219 | 7.0% |
| 4 | 196 | 6.3% |
| 6 | 173 | 5.5% |
| 7 | 162 | 5.2% |
| 9 | 150 | 4.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2275 | |
| — | 8 | 0.3% |
| – | 4 | 0.2% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 138 | |
| + | 8 | 5.3% |
| < | 5 | 3.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5049 | |
| [ | 36 | 0.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5045 | |
| ] | 36 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 165858 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1216125 | |
| Common | 225561 | 15.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 122882 | 10.1% |
| a | 115017 | 9.5% |
| r | 97779 | 8.0% |
| o | 97024 | 8.0% |
| s | 87680 | 7.2% |
| i | 77128 | 6.3% |
| n | 73943 | 6.1% |
| t | 69140 | 5.7% |
| l | 65307 | 5.4% |
| c | 52978 | 4.4% |
| Other values (43) | 357247 |
Common
| Value | Count | Frequency (%) |
| 165858 | ||
| , | 22792 | 10.1% |
| . | 11645 | 5.2% |
| ( | 5049 | 2.2% |
| ) | 5045 | 2.2% |
| : | 4679 | 2.1% |
| / | 2864 | 1.3% |
| - | 2275 | 1.0% |
| ; | 1481 | 0.7% |
| 0 | 714 | 0.3% |
| Other values (24) | 3159 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1441668 | |
| Punctuation | 12 | < 0.1% |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 165858 | 11.5% | |
| e | 122882 | 8.5% |
| a | 115017 | 8.0% |
| r | 97779 | 6.8% |
| o | 97024 | 6.7% |
| s | 87680 | 6.1% |
| i | 77128 | 5.3% |
| n | 73943 | 5.1% |
| t | 69140 | 4.8% |
| l | 65307 | 4.5% |
| Other values (74) | 469910 |
Punctuation
| Value | Count | Frequency (%) |
| — | 8 | |
| – | 4 |
None
| Value | Count | Frequency (%) |
| ñ | 6 |
locationID
Text
Missing 
| Distinct | 4570 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 284620 |
| Missing (%) | 84.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 6.812862326 |
| Min length | 1 |
Unique
| Unique | 1199 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | T548 |
|---|---|
| 2nd row | BIZ-231 |
| 3rd row | T488 |
| 4th row | 02-10 |
| 5th row | VES117 |
| Value | Count | Frequency (%) |
| 080611_minv_014 | 627 | 1.1% |
| site | 469 | 0.8% |
| trawl | 456 | 0.8% |
| i | 456 | 0.8% |
| serc | 326 | 0.6% |
| 14 | 313 | 0.6% |
| v1951 | 308 | 0.5% |
| 080608_minv_012 | 289 | 0.5% |
| 10 | 275 | 0.5% |
| 21 | 275 | 0.5% |
| Other values (4452) | 53036 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37140 | 10.2% |
| 1 | 34688 | 9.5% |
| - | 19187 | 5.3% |
| 2 | 18285 | 5.0% |
| I | 15952 | 4.4% |
| _ | 15373 | 4.2% |
| 5 | 13776 | 3.8% |
| 4 | 13693 | 3.8% |
| 8 | 13263 | 3.6% |
| 6 | 12669 | 3.5% |
| Other values (73) | 170285 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 176505 | |
| Uppercase Letter | 123781 | |
| Lowercase Letter | 23388 | 6.4% |
| Dash Punctuation | 19187 | 5.3% |
| Connector Punctuation | 15373 | 4.2% |
| Space Separator | 3356 | 0.9% |
| Other Punctuation | 2271 | 0.6% |
| Open Punctuation | 203 | 0.1% |
| Close Punctuation | 202 | 0.1% |
| Math Symbol | 40 | < 0.1% |
| Other values (2) | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 15952 | |
| A | 12413 | 10.0% |
| B | 12220 | 9.9% |
| S | 9794 | 7.9% |
| M | 9584 | 7.7% |
| T | 7525 | 6.1% |
| Z | 6269 | 5.1% |
| O | 5814 | 4.7% |
| N | 5745 | 4.6% |
| V | 4402 | 3.6% |
| Other values (18) | 34063 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2390 | |
| i | 2331 | |
| e | 1964 | 8.4% |
| m | 1947 | 8.3% |
| o | 1868 | 8.0% |
| a | 1852 | 7.9% |
| t | 1743 | 7.5% |
| r | 1556 | 6.7% |
| v | 1293 | 5.5% |
| g | 941 | 4.0% |
| Other values (17) | 5503 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 37140 | |
| 1 | 34688 | |
| 2 | 18285 | |
| 5 | 13776 | 7.8% |
| 4 | 13693 | 7.8% |
| 8 | 13263 | 7.5% |
| 6 | 12669 | 7.2% |
| 3 | 12421 | 7.0% |
| 7 | 11375 | 6.4% |
| 9 | 9195 | 5.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1117 | |
| . | 1044 | |
| # | 108 | 4.8% |
| , | 2 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 196 | |
| [ | 6 | 3.0% |
| ‚ | 1 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 37 | |
| ¬ | 2 | 5.0% |
| + | 1 | 2.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 196 | |
| ] | 6 | 3.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 3 | |
| € | 1 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19187 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15373 |
Space Separator
| Value | Count | Frequency (%) |
| 3356 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 217142 | |
| Latin | 147169 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 15952 | 10.8% |
| A | 12413 | 8.4% |
| B | 12220 | 8.3% |
| S | 9794 | 6.7% |
| M | 9584 | 6.5% |
| T | 7525 | 5.1% |
| Z | 6269 | 4.3% |
| O | 5814 | 4.0% |
| N | 5745 | 3.9% |
| V | 4402 | 3.0% |
| Other values (45) | 57451 |
Common
| Value | Count | Frequency (%) |
| 0 | 37140 | |
| 1 | 34688 | |
| - | 19187 | |
| 2 | 18285 | |
| _ | 15373 | |
| 5 | 13776 | 6.3% |
| 4 | 13693 | 6.3% |
| 8 | 13263 | 6.1% |
| 6 | 12669 | 5.8% |
| 3 | 12421 | 5.7% |
| Other values (18) | 26647 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 364293 | |
| None | 15 | < 0.1% |
| Punctuation | 2 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 37140 | 10.2% |
| 1 | 34688 | 9.5% |
| - | 19187 | 5.3% |
| 2 | 18285 | 5.0% |
| I | 15952 | 4.4% |
| _ | 15373 | 4.2% |
| 5 | 13776 | 3.8% |
| 4 | 13693 | 3.8% |
| 8 | 13263 | 3.6% |
| 6 | 12669 | 3.5% |
| Other values (62) | 170267 |
None
| Value | Count | Frequency (%) |
| Ã | 3 | |
| ¢ | 3 | |
| Â | 2 | |
| â | 2 | |
| ¬ | 2 | |
| ƒ | 1 | 6.7% |
| š | 1 | 6.7% |
| Å | 1 | 6.7% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 1 | |
| “ | 1 |
higherGeography
Text
Missing 
| Distinct | 7779 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 4531 |
| Missing (%) | 1.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 128 |
|---|---|
| Median length | 103 |
| Mean length | 44.48305717 |
| Min length | 4 |
Unique
| Unique | 787 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | United States, Arizona, Cochise |
|---|---|
| 2nd row | North Pacific Ocean, Gulf of California, Mexico |
| 3rd row | South Pacific Ocean, French Polynesia, Society Islands, Moorea |
| 4th row | United States, Arkansas |
| 5th row | Asia-Temperate, China, Xizang, Nielamu (Nyalam) Xian |
| Value | Count | Frequency (%) |
| states | 150734 | 7.6% |
| united | 150654 | 7.6% |
| north | 101817 | 5.1% |
| ocean | 69413 | 3.5% |
| pacific | 66261 | 3.4% |
| america | 65435 | 3.3% |
| not | 60307 | 3.0% |
| stated | 60307 | 3.0% |
| islands | 44071 | 2.2% |
| atlantic | 41374 | 2.1% |
| Other values (4525) | 1167157 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1643967 | 11.1% | |
| a | 1472182 | 9.9% |
| t | 1108058 | 7.5% |
| e | 1084394 | 7.3% |
| i | 1040753 | 7.0% |
| n | 860928 | 5.8% |
| , | 825692 | 5.6% |
| o | 731207 | 4.9% |
| r | 620008 | 4.2% |
| s | 541802 | 3.7% |
| Other values (88) | 4908911 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10246955 | |
| Uppercase Letter | 1952392 | 13.2% |
| Space Separator | 1643967 | 11.1% |
| Other Punctuation | 836703 | 5.6% |
| Close Punctuation | 63011 | 0.4% |
| Open Punctuation | 63011 | 0.4% |
| Dash Punctuation | 30878 | 0.2% |
| Modifier Letter | 813 | < 0.1% |
| Decimal Number | 169 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1472182 | |
| t | 1108058 | |
| e | 1084394 | |
| i | 1040753 | |
| n | 860928 | |
| o | 731207 | 7.1% |
| r | 620008 | 6.1% |
| s | 541802 | 5.3% |
| c | 500877 | 4.9% |
| l | 396581 | 3.9% |
| Other values (36) | 1890165 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 344369 | |
| N | 206279 | |
| A | 200821 | |
| C | 181308 | |
| U | 160228 | |
| P | 159029 | |
| M | 93185 | 4.8% |
| O | 87914 | 4.5% |
| B | 72819 | 3.7% |
| I | 68288 | 3.5% |
| Other values (20) | 378152 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 825692 | |
| . | 7803 | 0.9% |
| ' | 2811 | 0.3% |
| ? | 201 | < 0.1% |
| / | 190 | < 0.1% |
| * | 5 | < 0.1% |
| ; | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 108 | |
| 1 | 24 | 14.2% |
| 2 | 16 | 9.5% |
| 9 | 13 | 7.7% |
| 0 | 8 | 4.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30862 | |
| – | 10 | < 0.1% |
| — | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 61090 | |
| ) | 1921 | 3.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 61090 | |
| ( | 1921 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1643967 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 813 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12199347 | |
| Common | 2638555 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1472182 | 12.1% |
| t | 1108058 | 9.1% |
| e | 1084394 | 8.9% |
| i | 1040753 | 8.5% |
| n | 860928 | 7.1% |
| o | 731207 | 6.0% |
| r | 620008 | 5.1% |
| s | 541802 | 4.4% |
| c | 500877 | 4.1% |
| l | 396581 | 3.3% |
| Other values (66) | 3842557 |
Common
| Value | Count | Frequency (%) |
| 1643967 | ||
| , | 825692 | |
| ] | 61090 | 2.3% |
| [ | 61090 | 2.3% |
| - | 30862 | 1.2% |
| . | 7803 | 0.3% |
| ' | 2811 | 0.1% |
| ) | 1921 | 0.1% |
| ( | 1921 | 0.1% |
| ʻ | 813 | < 0.1% |
| Other values (12) | 585 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14824256 | |
| None | 12817 | 0.1% |
| Modifier Letters | 813 | < 0.1% |
| Punctuation | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1643967 | 11.1% | |
| a | 1472182 | 9.9% |
| t | 1108058 | 7.5% |
| e | 1084394 | 7.3% |
| i | 1040753 | 7.0% |
| n | 860928 | 5.8% |
| , | 825692 | 5.6% |
| o | 731207 | 4.9% |
| r | 620008 | 4.2% |
| s | 541802 | 3.7% |
| Other values (61) | 4895265 |
None
| Value | Count | Frequency (%) |
| é | 3472 | |
| í | 2109 | |
| ã | 1904 | |
| Î | 1377 | 10.7% |
| ó | 1025 | 8.0% |
| ā | 813 | 6.3% |
| ç | 805 | 6.3% |
| á | 431 | 3.4% |
| ä | 239 | 1.9% |
| ö | 194 | 1.5% |
| Other values (14) | 448 | 3.5% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 813 |
Punctuation
| Value | Count | Frequency (%) |
| – | 10 | |
| — | 6 |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 57738 |
| Missing (%) | 17.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.55335716 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | OCEANIA |
| 3rd row | ASIA |
| 4th row | AFRICA |
| 5th row | OCEANIA |
| Value | Count | Frequency (%) |
| north_america | 154617 | |
| oceania | 41626 | 14.8% |
| asia | 32094 | 11.4% |
| south_america | 30956 | 11.0% |
| africa | 17455 | 6.2% |
| europe | 3580 | 1.3% |
| antarctica | 28 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 553580 | |
| R | 361253 | |
| I | 276776 | |
| C | 244710 | |
| E | 234359 | |
| O | 230779 | |
| N | 196271 | 6.6% |
| T | 185629 | 6.3% |
| H | 185573 | 6.3% |
| _ | 185573 | 6.3% |
| Other values (5) | 304194 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2773124 | |
| Connector Punctuation | 185573 | 6.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 553580 | |
| R | 361253 | |
| I | 276776 | |
| C | 244710 | |
| E | 234359 | |
| O | 230779 | |
| N | 196271 | 7.1% |
| T | 185629 | 6.7% |
| H | 185573 | 6.7% |
| M | 185573 | 6.7% |
| Other values (4) | 118621 | 4.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 185573 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2773124 | |
| Common | 185573 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 553580 | |
| R | 361253 | |
| I | 276776 | |
| C | 244710 | |
| E | 234359 | |
| O | 230779 | |
| N | 196271 | 7.1% |
| T | 185629 | 6.7% |
| H | 185573 | 6.7% |
| M | 185573 | 6.7% |
| Other values (4) | 118621 | 4.3% |
Common
| Value | Count | Frequency (%) |
| _ | 185573 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2958697 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 553580 | |
| R | 361253 | |
| I | 276776 | |
| C | 244710 | |
| E | 234359 | |
| O | 230779 | |
| N | 196271 | 6.6% |
| T | 185629 | 6.3% |
| H | 185573 | 6.3% |
| _ | 185573 | 6.3% |
| Other values (5) | 304194 |
waterBody
Text
Missing 
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 231346 |
| Missing (%) | 68.4% |
| Memory size | 2.6 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 53 |
| Mean length | 20.41937085 |
| Min length | 6 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Pacific Ocean, Gulf of California |
|---|---|
| 2nd row | South Pacific Ocean |
| 3rd row | North Atlantic Ocean |
| 4th row | Pacific |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 69162 | |
| pacific | 61578 | |
| north | 47089 | |
| atlantic | 41318 | |
| south | 18400 | 5.6% |
| sea | 18234 | 5.6% |
| caribbean | 14724 | 4.5% |
| bay | 12118 | 3.7% |
| gulf | 7267 | 2.2% |
| of | 6749 | 2.1% |
| Other values (198) | 31204 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 263495 | |
| c | 237624 | |
| 221095 | 10.1% | |
| i | 195151 | 9.0% |
| t | 152988 | 7.0% |
| n | 144138 | 6.6% |
| e | 133315 | 6.1% |
| o | 87930 | 4.0% |
| f | 78762 | 3.6% |
| h | 75874 | 3.5% |
| Other values (45) | 589355 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1598402 | |
| Uppercase Letter | 321268 | 14.7% |
| Space Separator | 221095 | 10.1% |
| Other Punctuation | 38149 | 1.8% |
| Modifier Letter | 813 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 263495 | |
| c | 237624 | |
| i | 195151 | |
| t | 152988 | |
| n | 144138 | |
| e | 133315 | |
| o | 87930 | 5.5% |
| f | 78762 | 4.9% |
| h | 75874 | 4.7% |
| r | 70327 | 4.4% |
| Other values (16) | 158798 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 69662 | |
| P | 64644 | |
| N | 47095 | |
| A | 41986 | |
| S | 38909 | |
| C | 22284 | 6.9% |
| B | 14232 | 4.4% |
| G | 7323 | 2.3% |
| K | 4833 | 1.5% |
| M | 3987 | 1.2% |
| Other values (13) | 6313 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 38070 | |
| ' | 73 | 0.2% |
| . | 5 | < 0.1% |
| ; | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 221095 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 813 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1919670 | |
| Common | 260057 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 263495 | |
| c | 237624 | |
| i | 195151 | |
| t | 152988 | 8.0% |
| n | 144138 | 7.5% |
| e | 133315 | 6.9% |
| o | 87930 | 4.6% |
| f | 78762 | 4.1% |
| h | 75874 | 4.0% |
| r | 70327 | 3.7% |
| Other values (39) | 480066 |
Common
| Value | Count | Frequency (%) |
| 221095 | ||
| , | 38070 | 14.6% |
| ʻ | 813 | 0.3% |
| ' | 73 | < 0.1% |
| . | 5 | < 0.1% |
| ; | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2178101 | |
| None | 813 | < 0.1% |
| Modifier Letters | 813 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 263495 | |
| c | 237624 | |
| 221095 | 10.2% | |
| i | 195151 | 9.0% |
| t | 152988 | 7.0% |
| n | 144138 | 6.6% |
| e | 133315 | 6.1% |
| o | 87930 | 4.0% |
| f | 78762 | 3.6% |
| h | 75874 | 3.5% |
| Other values (43) | 587729 |
None
| Value | Count | Frequency (%) |
| ā | 813 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 813 |
islandGroup
Text
Missing 
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 315374 |
| Missing (%) | 93.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 14.50514965 |
| Min length | 5 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Society Islands |
|---|---|
| 2nd row | Leeward Antilles |
| 3rd row | Bahama Islands |
| 4th row | Society Islands |
| 5th row | Visayas |
| Value | Count | Frequency (%) |
| islands | 15107 | |
| society | 10375 | |
| leeward | 3580 | 7.6% |
| antilles | 3191 | 6.7% |
| îles | 1360 | 2.9% |
| vent | 1360 | 2.9% |
| du | 1300 | 2.7% |
| cays | 1105 | 2.3% |
| bahama | 989 | 2.1% |
| group | 827 | 1.7% |
| Other values (103) | 8209 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 39205 | |
| a | 28802 | 8.7% |
| e | 28031 | 8.5% |
| 24683 | 7.5% | |
| l | 24467 | 7.4% |
| n | 22700 | 6.9% |
| d | 21934 | 6.7% |
| i | 17625 | 5.3% |
| t | 16043 | 4.9% |
| I | 15495 | 4.7% |
| Other values (41) | 90572 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 256862 | |
| Uppercase Letter | 47651 | 14.5% |
| Space Separator | 24683 | 7.5% |
| Other Punctuation | 361 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 39205 | |
| a | 28802 | |
| e | 28031 | |
| l | 24467 | |
| n | 22700 | |
| d | 21934 | |
| i | 17625 | |
| t | 16043 | |
| o | 12680 | 4.9% |
| y | 11936 | 4.6% |
| Other values (15) | 33439 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 15495 | |
| S | 11283 | |
| L | 4421 | 9.3% |
| A | 4266 | 9.0% |
| V | 2159 | 4.5% |
| B | 2058 | 4.3% |
| C | 2045 | 4.3% |
| Î | 1360 | 2.9% |
| P | 926 | 1.9% |
| G | 916 | 1.9% |
| Other values (13) | 2722 | 5.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 353 | |
| ' | 8 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 24683 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 304513 | |
| Common | 25044 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 39205 | |
| a | 28802 | 9.5% |
| e | 28031 | 9.2% |
| l | 24467 | 8.0% |
| n | 22700 | 7.5% |
| d | 21934 | 7.2% |
| i | 17625 | 5.8% |
| t | 16043 | 5.3% |
| I | 15495 | 5.1% |
| o | 12680 | 4.2% |
| Other values (38) | 77531 |
Common
| Value | Count | Frequency (%) |
| 24683 | ||
| . | 353 | 1.4% |
| ' | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 328197 | |
| None | 1360 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 39205 | |
| a | 28802 | 8.8% |
| e | 28031 | 8.5% |
| 24683 | 7.5% | |
| l | 24467 | 7.5% |
| n | 22700 | 6.9% |
| d | 21934 | 6.7% |
| i | 17625 | 5.4% |
| t | 16043 | 4.9% |
| I | 15495 | 4.7% |
| Other values (40) | 89212 |
None
| Value | Count | Frequency (%) |
| Î | 1360 |
island
Text
Missing 
| Distinct | 566 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 279260 |
| Missing (%) | 82.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 25 |
| Mean length | 8.431383214 |
| Min length | 3 |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Moorea |
|---|---|
| 2nd row | Moorea |
| 3rd row | Mindanao |
| 4th row | Klein Curacao |
| 5th row | Moorea |
| Value | Count | Frequency (%) |
| moorea | 15941 | |
| cay | 7341 | 8.5% |
| carrie | 4785 | 5.5% |
| bow | 4785 | 5.5% |
| island | 4062 | 4.7% |
| curacao | 3674 | 4.3% |
| oahu | 2249 | 2.6% |
| luzon | 2088 | 2.4% |
| borneo | 2043 | 2.4% |
| atoll | 914 | 1.1% |
| Other values (560) | 38461 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 78531 | |
| o | 63637 | |
| r | 44894 | 9.1% |
| e | 38281 | 7.7% |
| 27509 | 5.5% | |
| u | 21061 | 4.2% |
| n | 20954 | 4.2% |
| i | 20832 | 4.2% |
| C | 19923 | 4.0% |
| M | 19688 | 4.0% |
| Other values (52) | 140742 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 379929 | |
| Uppercase Letter | 86074 | 17.4% |
| Space Separator | 27509 | 5.5% |
| Close Punctuation | 801 | 0.2% |
| Open Punctuation | 801 | 0.2% |
| Other Punctuation | 780 | 0.2% |
| Dash Punctuation | 158 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 78531 | |
| o | 63637 | |
| r | 44894 | |
| e | 38281 | |
| u | 21061 | 5.5% |
| n | 20954 | 5.5% |
| i | 20832 | 5.5% |
| l | 12002 | 3.2% |
| s | 11378 | 3.0% |
| y | 10662 | 2.8% |
| Other values (19) | 57697 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 19923 | |
| M | 19688 | |
| B | 8928 | |
| I | 4675 | 5.4% |
| T | 4460 | 5.2% |
| S | 3543 | 4.1% |
| L | 3257 | 3.8% |
| H | 2574 | 3.0% |
| O | 2570 | 3.0% |
| P | 2469 | 2.9% |
| Other values (16) | 13987 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 705 | |
| . | 73 | 9.4% |
| , | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 27509 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 801 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 801 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 158 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 466003 | |
| Common | 30049 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 78531 | |
| o | 63637 | |
| r | 44894 | 9.6% |
| e | 38281 | 8.2% |
| u | 21061 | 4.5% |
| n | 20954 | 4.5% |
| i | 20832 | 4.5% |
| C | 19923 | 4.3% |
| M | 19688 | 4.2% |
| l | 12002 | 2.6% |
| Other values (45) | 126200 |
Common
| Value | Count | Frequency (%) |
| 27509 | ||
| ] | 801 | 2.7% |
| [ | 801 | 2.7% |
| ' | 705 | 2.3% |
| - | 158 | 0.5% |
| . | 73 | 0.2% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 495602 | |
| None | 450 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 78531 | |
| o | 63637 | |
| r | 44894 | 9.1% |
| e | 38281 | 7.7% |
| 27509 | 5.6% | |
| u | 21061 | 4.2% |
| n | 20954 | 4.2% |
| i | 20832 | 4.2% |
| C | 19923 | 4.0% |
| M | 19688 | 4.0% |
| Other values (47) | 140292 |
None
| Value | Count | Frequency (%) |
| ç | 380 | |
| ó | 34 | 7.6% |
| ò | 19 | 4.2% |
| Î | 14 | 3.1% |
| Ž | 3 | 0.7% |
countryCode
Text
Missing 
| Distinct | 203 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11127 |
| Missing (%) | 3.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | MX |
| 3rd row | PF |
| 4th row | US |
| 5th row | CN |
| Value | Count | Frequency (%) |
| us | 150956 | |
| pf | 22993 | 7.0% |
| mx | 10948 | 3.3% |
| pa | 9204 | 2.8% |
| bz | 9189 | 2.8% |
| mm | 8052 | 2.5% |
| ph | 6777 | 2.1% |
| gy | 5990 | 1.8% |
| pg | 4467 | 1.4% |
| cw | 4291 | 1.3% |
| Other values (193) | 94100 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 158858 | |
| U | 156371 | |
| P | 51767 | 7.9% |
| M | 36656 | 5.6% |
| F | 27494 | 4.2% |
| C | 25318 | 3.9% |
| A | 21589 | 3.3% |
| G | 20634 | 3.2% |
| B | 18213 | 2.8% |
| Z | 16360 | 2.5% |
| Other values (16) | 120674 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 653934 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 158858 | |
| U | 156371 | |
| P | 51767 | 7.9% |
| M | 36656 | 5.6% |
| F | 27494 | 4.2% |
| C | 25318 | 3.9% |
| A | 21589 | 3.3% |
| G | 20634 | 3.2% |
| B | 18213 | 2.8% |
| Z | 16360 | 2.5% |
| Other values (16) | 120674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 653934 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 158858 | |
| U | 156371 | |
| P | 51767 | 7.9% |
| M | 36656 | 5.6% |
| F | 27494 | 4.2% |
| C | 25318 | 3.9% |
| A | 21589 | 3.3% |
| G | 20634 | 3.2% |
| B | 18213 | 2.8% |
| Z | 16360 | 2.5% |
| Other values (16) | 120674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 653934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 158858 | |
| U | 156371 | |
| P | 51767 | 7.9% |
| M | 36656 | 5.6% |
| F | 27494 | 4.2% |
| C | 25318 | 3.9% |
| A | 21589 | 3.3% |
| G | 20634 | 3.2% |
| B | 18213 | 2.8% |
| Z | 16360 | 2.5% |
| Other values (16) | 120674 |
stateProvince
Text
Missing 
| Distinct | 1646 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 66137 |
| Missing (%) | 19.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 52 |
|---|---|
| Median length | 42 |
| Mean length | 9.616295959 |
| Min length | 3 |
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arizona |
|---|---|
| 2nd row | Arkansas |
| 3rd row | Xizang |
| 4th row | Laikipia |
| 5th row | Florida |
| Value | Count | Frequency (%) |
| california | 17057 | 4.6% |
| florida | 16471 | 4.4% |
| texas | 14319 | 3.9% |
| virginia | 13034 | 3.5% |
| not | 10630 | 2.9% |
| stated | 10630 | 2.9% |
| arizona | 9677 | 2.6% |
| carolina | 8845 | 2.4% |
| region | 8363 | 2.3% |
| new | 8067 | 2.2% |
| Other values (1667) | 253487 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 361404 | |
| i | 256979 | 9.8% |
| n | 192389 | 7.4% |
| o | 190990 | 7.3% |
| r | 175132 | 6.7% |
| e | 143513 | 5.5% |
| s | 116807 | 4.5% |
| t | 109006 | 4.2% |
| l | 104505 | 4.0% |
| 98623 | 3.8% | |
| Other values (72) | 865871 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2112662 | |
| Uppercase Letter | 370786 | 14.2% |
| Space Separator | 98623 | 3.8% |
| Open Punctuation | 10906 | 0.4% |
| Close Punctuation | 10906 | 0.4% |
| Dash Punctuation | 8610 | 0.3% |
| Other Punctuation | 2605 | 0.1% |
| Decimal Number | 121 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 361404 | |
| i | 256979 | |
| n | 192389 | |
| o | 190990 | |
| r | 175132 | |
| e | 143513 | 6.8% |
| s | 116807 | 5.5% |
| t | 109006 | 5.2% |
| l | 104505 | 4.9% |
| u | 68862 | 3.3% |
| Other values (31) | 393075 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 47905 | |
| T | 35455 | 9.6% |
| S | 34249 | 9.2% |
| N | 33647 | 9.1% |
| M | 32034 | 8.6% |
| A | 24596 | 6.6% |
| F | 18709 | 5.0% |
| V | 16282 | 4.4% |
| P | 16173 | 4.4% |
| I | 12002 | 3.2% |
| Other values (18) | 99734 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2197 | |
| ' | 211 | 8.1% |
| / | 93 | 3.6% |
| , | 61 | 2.3% |
| ? | 43 | 1.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 10606 | |
| ( | 300 | 2.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 10606 | |
| ) | 300 | 2.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 108 | |
| 9 | 13 | 10.7% |
Space Separator
| Value | Count | Frequency (%) |
| 98623 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8610 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2483448 | |
| Common | 131771 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 361404 | |
| i | 256979 | 10.3% |
| n | 192389 | 7.7% |
| o | 190990 | 7.7% |
| r | 175132 | 7.1% |
| e | 143513 | 5.8% |
| s | 116807 | 4.7% |
| t | 109006 | 4.4% |
| l | 104505 | 4.2% |
| u | 68862 | 2.8% |
| Other values (59) | 763861 |
Common
| Value | Count | Frequency (%) |
| 98623 | ||
| [ | 10606 | 8.0% |
| ] | 10606 | 8.0% |
| - | 8610 | 6.5% |
| . | 2197 | 1.7% |
| ( | 300 | 0.2% |
| ) | 300 | 0.2% |
| ' | 211 | 0.2% |
| 3 | 108 | 0.1% |
| / | 93 | 0.1% |
| Other values (3) | 117 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2608986 | |
| None | 6233 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 361404 | |
| i | 256979 | 9.8% |
| n | 192389 | 7.4% |
| o | 190990 | 7.3% |
| r | 175132 | 6.7% |
| e | 143513 | 5.5% |
| s | 116807 | 4.5% |
| t | 109006 | 4.2% |
| l | 104505 | 4.0% |
| 98623 | 3.8% | |
| Other values (55) | 859638 |
None
| Value | Count | Frequency (%) |
| é | 2424 | |
| ã | 977 | |
| ó | 950 | 15.2% |
| í | 867 | 13.9% |
| á | 390 | 6.3% |
| ä | 239 | 3.8% |
| ö | 185 | 3.0% |
| ñ | 88 | 1.4% |
| ô | 45 | 0.7% |
| ü | 17 | 0.3% |
| Other values (7) | 51 | 0.8% |
county
Text
Missing 
| Distinct | 3053 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 140475 |
| Missing (%) | 41.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 35 |
| Mean length | 10.83467683 |
| Min length | 1 |
Unique
| Unique | 295 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Cochise |
|---|---|
| 2nd row | Nielamu (Nyalam) Xian |
| 3rd row | [Not Stated] |
| 4th row | [Not Stated] |
| 5th row | [Not Stated] |
| Value | Count | Frequency (%) |
| not | 49620 | 15.0% |
| stated | 49620 | 15.0% |
| county | 38478 | 11.6% |
| honolulu | 5034 | 1.5% |
| san | 4615 | 1.4% |
| st | 3587 | 1.1% |
| cochise | 3337 | 1.0% |
| lucie | 3224 | 1.0% |
| island | 2682 | 0.8% |
| xian | 2350 | 0.7% |
| Other values (2542) | 168921 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 236676 | 11.1% |
| o | 187861 | 8.8% |
| a | 183049 | 8.5% |
| e | 150027 | 7.0% |
| n | 138728 | 6.5% |
| 133849 | 6.3% | |
| u | 85231 | 4.0% |
| i | 84530 | 3.9% |
| d | 76606 | 3.6% |
| r | 76271 | 3.6% |
| Other values (73) | 788310 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1567038 | |
| Uppercase Letter | 329801 | 15.4% |
| Space Separator | 133849 | 6.3% |
| Open Punctuation | 50849 | 2.4% |
| Close Punctuation | 50849 | 2.4% |
| Other Punctuation | 6646 | 0.3% |
| Dash Punctuation | 2055 | 0.1% |
| Decimal Number | 48 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 236676 | |
| o | 187861 | |
| a | 183049 | |
| e | 150027 | |
| n | 138728 | |
| u | 85231 | 5.4% |
| i | 84530 | 5.4% |
| d | 76606 | 4.9% |
| r | 76271 | 4.9% |
| l | 60245 | 3.8% |
| Other values (28) | 287814 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 72930 | |
| C | 59829 | |
| N | 54898 | |
| H | 14147 | 4.3% |
| B | 13868 | 4.2% |
| M | 13860 | 4.2% |
| P | 13308 | 4.0% |
| L | 12983 | 3.9% |
| A | 12161 | 3.7% |
| D | 9867 | 3.0% |
| Other values (18) | 51950 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4070 | |
| ' | 1743 | |
| , | 625 | 9.4% |
| ? | 107 | 1.6% |
| / | 96 | 1.4% |
| * | 5 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 2 | 16 | |
| 0 | 8 | 16.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 49629 | |
| ( | 1220 | 2.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 49629 | |
| ) | 1220 | 2.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2045 | |
| – | 10 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 133849 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1896839 | |
| Common | 244299 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 236676 | 12.5% |
| o | 187861 | 9.9% |
| a | 183049 | 9.7% |
| e | 150027 | 7.9% |
| n | 138728 | 7.3% |
| u | 85231 | 4.5% |
| i | 84530 | 4.5% |
| d | 76606 | 4.0% |
| r | 76271 | 4.0% |
| S | 72930 | 3.8% |
| Other values (56) | 604930 |
Common
| Value | Count | Frequency (%) |
| 133849 | ||
| [ | 49629 | 20.3% |
| ] | 49629 | 20.3% |
| . | 4070 | 1.7% |
| - | 2045 | 0.8% |
| ' | 1743 | 0.7% |
| ) | 1220 | 0.5% |
| ( | 1220 | 0.5% |
| , | 625 | 0.3% |
| ? | 107 | < 0.1% |
| Other values (7) | 162 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2140252 | |
| None | 876 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 236676 | 11.1% |
| o | 187861 | 8.8% |
| a | 183049 | 8.6% |
| e | 150027 | 7.0% |
| n | 138728 | 6.5% |
| 133849 | 6.3% | |
| u | 85231 | 4.0% |
| i | 84530 | 3.9% |
| d | 76606 | 3.6% |
| r | 76271 | 3.6% |
| Other values (58) | 787424 |
None
| Value | Count | Frequency (%) |
| í | 360 | |
| ü | 153 | |
| é | 136 | 15.5% |
| ã | 45 | 5.1% |
| á | 41 | 4.7% |
| ó | 38 | 4.3% |
| â | 32 | 3.7% |
| ç | 25 | 2.9% |
| ô | 15 | 1.7% |
| ö | 9 | 1.0% |
| Other values (4) | 22 | 2.5% |
Punctuation
| Value | Count | Frequency (%) |
| – | 10 |
locality
Text
Missing 
| Distinct | 31944 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 34045 |
| Missing (%) | 10.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 312 |
|---|---|
| Median length | 249 |
| Mean length | 40.81951593 |
| Min length | 3 |
Unique
| Unique | 4485 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Carr Canyon, Huachuca Mountains |
|---|---|
| 2nd row | Society Islands, Moorea, In front of Hilton |
| 3rd row | Ashdown |
| 4th row | Nielamu Zhen. Route 318 between Zhangmu and Nielamu (Nyalam) ca. 8 km from Zhangmu. |
| 5th row | Mpala Research Centre |
| Value | Count | Frequency (%) |
| of | 95416 | 4.7% |
| km | 27888 | 1.4% |
| road | 25983 | 1.3% |
| on | 20774 | 1.0% |
| island | 19621 | 1.0% |
| and | 19459 | 1.0% |
| national | 18145 | 0.9% |
| river | 17516 | 0.9% |
| creek | 15244 | 0.8% |
| at | 14855 | 0.7% |
| Other values (27242) | 1755591 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1726443 | 13.9% | |
| a | 1102256 | 8.9% |
| e | 888496 | 7.2% |
| o | 818946 | 6.6% |
| n | 661740 | 5.3% |
| i | 647279 | 5.2% |
| r | 607431 | 4.9% |
| t | 591988 | 4.8% |
| l | 448950 | 3.6% |
| s | 433370 | 3.5% |
| Other values (123) | 4484234 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8489111 | |
| Space Separator | 1726443 | 13.9% |
| Uppercase Letter | 1410083 | 11.4% |
| Other Punctuation | 435833 | 3.5% |
| Decimal Number | 258754 | 2.1% |
| Close Punctuation | 32192 | 0.3% |
| Open Punctuation | 32178 | 0.3% |
| Dash Punctuation | 20746 | 0.2% |
| Other Symbol | 2899 | < 0.1% |
| Math Symbol | 1876 | < 0.1% |
| Other values (7) | 1018 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1102256 | |
| e | 888496 | |
| o | 818946 | |
| n | 661740 | 7.8% |
| i | 647279 | 7.6% |
| r | 607431 | 7.2% |
| t | 591988 | 7.0% |
| l | 448950 | 5.3% |
| s | 433370 | 5.1% |
| u | 303856 | 3.6% |
| Other values (44) | 1984799 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 160087 | 11.4% |
| C | 145294 | 10.3% |
| M | 103557 | 7.3% |
| B | 101524 | 7.2% |
| R | 99658 | 7.1% |
| P | 98501 | 7.0% |
| N | 89448 | 6.3% |
| I | 60353 | 4.3% |
| A | 59240 | 4.2% |
| L | 54573 | 3.9% |
| Other values (24) | 437848 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 297675 | |
| . | 103227 | 23.7% |
| ' | 10607 | 2.4% |
| ; | 7916 | 1.8% |
| " | 4321 | 1.0% |
| : | 4145 | 1.0% |
| / | 3509 | 0.8% |
| # | 2991 | 0.7% |
| & | 669 | 0.2% |
| @ | 609 | 0.1% |
| Other values (2) | 164 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 50565 | |
| 2 | 34503 | |
| 0 | 34082 | |
| 5 | 31120 | |
| 3 | 25450 | |
| 4 | 21075 | |
| 6 | 17830 | 6.9% |
| 7 | 16310 | 6.3% |
| 9 | 14516 | 5.6% |
| 8 | 13303 | 5.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1276 | |
| ~ | 431 | 23.0% |
| + | 123 | 6.6% |
| > | 35 | 1.9% |
| < | 8 | 0.4% |
| | | 3 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26298 | |
| [ | 5879 | 18.3% |
| ‚ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 26313 | |
| ] | 5879 | 18.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20738 | |
| – | 8 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2896 | |
| ™ | 3 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1726443 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 813 |
Other Letter
| Value | Count | Frequency (%) |
| º | 158 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 23 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 10 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 6 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 5 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9899352 | |
| Common | 2511781 | 20.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1102256 | 11.1% |
| e | 888496 | 9.0% |
| o | 818946 | 8.3% |
| n | 661740 | 6.7% |
| i | 647279 | 6.5% |
| r | 607431 | 6.1% |
| t | 591988 | 6.0% |
| l | 448950 | 4.5% |
| s | 433370 | 4.4% |
| u | 303856 | 3.1% |
| Other values (79) | 3395040 |
Common
| Value | Count | Frequency (%) |
| 1726443 | ||
| , | 297675 | 11.9% |
| . | 103227 | 4.1% |
| 1 | 50565 | 2.0% |
| 2 | 34503 | 1.4% |
| 0 | 34082 | 1.4% |
| 5 | 31120 | 1.2% |
| ) | 26313 | 1.0% |
| ( | 26298 | 1.0% |
| 3 | 25450 | 1.0% |
| Other values (34) | 156105 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12400051 | |
| None | 10096 | 0.1% |
| Modifier Letters | 813 | < 0.1% |
| Latin Ext Additional | 142 | < 0.1% |
| Punctuation | 25 | < 0.1% |
| Currency Symbols | 3 | < 0.1% |
| Letterlike Symbols | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1726443 | 13.9% | |
| a | 1102256 | 8.9% |
| e | 888496 | 7.2% |
| o | 818946 | 6.6% |
| n | 661740 | 5.3% |
| i | 647279 | 5.2% |
| r | 607431 | 4.9% |
| t | 591988 | 4.8% |
| l | 448950 | 3.6% |
| s | 433370 | 3.5% |
| Other values (77) | 4473152 |
None
| Value | Count | Frequency (%) |
| ° | 2896 | |
| è | 1904 | |
| é | 1026 | 10.2% |
| í | 1025 | 10.2% |
| ā | 813 | 8.1% |
| á | 677 | 6.7% |
| ó | 376 | 3.7% |
| ô | 224 | 2.2% |
| ã | 207 | 2.1% |
| ñ | 167 | 1.7% |
| Other values (24) | 781 | 7.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 813 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 56 | |
| ể | 56 | |
| ỏ | 10 | 7.0% |
| ả | 10 | 7.0% |
| ố | 10 | 7.0% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 10 | |
| – | 8 | |
| “ | 6 | |
| ‚ | 1 | 4.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 3 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 3 |
Missing 
| Distinct | 913 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 322170 |
| Missing (%) | 95.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 27 |
| Mean length | 6.528761618 |
| Min length | 1 |
Unique
| Unique | 165 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 760 m |
|---|---|
| 2nd row | 1050 ft |
| 3rd row | 611 m |
| 4th row | 73 m |
| 5th row | 500 ft |
| Value | Count | Frequency (%) |
| m | 8065 | |
| ft | 7360 | |
| ca | 904 | 2.6% |
| 503 | 1.5% | |
| 50 | 384 | 1.1% |
| 3440 | 336 | 1.0% |
| sea | 323 | 0.9% |
| level | 323 | 0.9% |
| 54 | 313 | 0.9% |
| 80 | 302 | 0.9% |
| Other values (758) | 15653 |
Most occurring characters
| Value | Count | Frequency (%) |
| 18542 | ||
| 0 | 15801 | |
| m | 8238 | 7.9% |
| t | 7837 | 7.5% |
| f | 7463 | 7.2% |
| 1 | 5477 | 5.3% |
| 5 | 4884 | 4.7% |
| 4 | 4548 | 4.4% |
| 3 | 4547 | 4.4% |
| 2 | 4229 | 4.1% |
| Other values (37) | 22398 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50308 | |
| Lowercase Letter | 31690 | |
| Space Separator | 18542 | 17.8% |
| Other Punctuation | 1208 | 1.2% |
| Dash Punctuation | 1026 | 1.0% |
| Uppercase Letter | 596 | 0.6% |
| Math Symbol | 364 | 0.4% |
| Open Punctuation | 115 | 0.1% |
| Close Punctuation | 115 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 8238 | |
| t | 7837 | |
| f | 7463 | |
| a | 1697 | 5.4% |
| e | 1622 | 5.1% |
| c | 1233 | 3.9% |
| l | 723 | 2.3% |
| s | 424 | 1.3% |
| r | 422 | 1.3% |
| v | 408 | 1.3% |
| Other values (12) | 1623 | 5.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15801 | |
| 1 | 5477 | 10.9% |
| 5 | 4884 | 9.7% |
| 4 | 4548 | 9.0% |
| 3 | 4547 | 9.0% |
| 2 | 4229 | 8.4% |
| 6 | 3499 | 7.0% |
| 8 | 3280 | 6.5% |
| 7 | 2220 | 4.4% |
| 9 | 1823 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1121 | |
| / | 70 | 5.8% |
| ? | 12 | 1.0% |
| ' | 4 | 0.3% |
| , | 1 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 197 | |
| P | 195 | |
| G | 195 | |
| L | 9 | 1.5% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 294 | |
| + | 70 | 19.2% |
Space Separator
| Value | Count | Frequency (%) |
| 18542 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1026 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 115 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 71678 | |
| Latin | 32286 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 8238 | |
| t | 7837 | |
| f | 7463 | |
| a | 1697 | 5.3% |
| e | 1622 | 5.0% |
| c | 1233 | 3.8% |
| l | 723 | 2.2% |
| s | 424 | 1.3% |
| r | 422 | 1.3% |
| v | 408 | 1.3% |
| Other values (16) | 2219 | 6.9% |
Common
| Value | Count | Frequency (%) |
| 18542 | ||
| 0 | 15801 | |
| 1 | 5477 | 7.6% |
| 5 | 4884 | 6.8% |
| 4 | 4548 | 6.3% |
| 3 | 4547 | 6.3% |
| 2 | 4229 | 5.9% |
| 6 | 3499 | 4.9% |
| 8 | 3280 | 4.6% |
| 7 | 2220 | 3.1% |
| Other values (11) | 4651 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103964 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 18542 | ||
| 0 | 15801 | |
| m | 8238 | 7.9% |
| t | 7837 | 7.5% |
| f | 7463 | 7.2% |
| 1 | 5477 | 5.3% |
| 5 | 4884 | 4.7% |
| 4 | 4548 | 4.4% |
| 3 | 4547 | 4.4% |
| 2 | 4229 | 4.1% |
| Other values (37) | 22398 |
verbatimDepth
Text
Missing 
| Distinct | 59 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 336615 |
| Missing (%) | 99.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 91 |
|---|---|
| Median length | 10 |
| Mean length | 8.625422583 |
| Min length | 2 |
Unique
| Unique | 29 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | to 1 m |
|---|---|
| 2nd row | intertidal |
| 3rd row | <0.5 m |
| 4th row | intertidal |
| 5th row | intertidal |
| Value | Count | Frequency (%) |
| intertidal | 778 | |
| m | 259 | 13.5% |
| surface | 253 | 13.2% |
| to | 103 | 5.4% |
| 1 | 95 | 4.9% |
| 0-1 | 84 | 4.4% |
| intertida | 84 | 4.4% |
| 0.5 | 68 | 3.5% |
| 1m | 47 | 2.4% |
| cm | 13 | 0.7% |
| Other values (55) | 138 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | 7.0% |
| d | 877 | 6.9% |
| l | 806 | 6.3% |
| 443 | 3.5% | |
| I | 353 | 2.8% |
| Other values (41) | 2672 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10679 | |
| Uppercase Letter | 644 | 5.0% |
| Decimal Number | 596 | 4.7% |
| Space Separator | 443 | 3.5% |
| Math Symbol | 161 | 1.3% |
| Other Punctuation | 108 | 0.8% |
| Dash Punctuation | 102 | 0.8% |
| Open Punctuation | 12 | 0.1% |
| Close Punctuation | 12 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | |
| d | 877 | |
| l | 806 | |
| m | 347 | 3.2% |
| c | 260 | 2.4% |
| Other values (12) | 783 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 252 | |
| 0 | 198 | |
| 5 | 82 | 13.8% |
| 2 | 29 | 4.9% |
| 3 | 12 | 2.0% |
| 4 | 5 | 0.8% |
| 6 | 5 | 0.8% |
| 8 | 5 | 0.8% |
| 9 | 4 | 0.7% |
| 7 | 4 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 353 | |
| S | 258 | |
| M | 12 | 1.9% |
| C | 10 | 1.6% |
| A | 5 | 0.8% |
| U | 4 | 0.6% |
| V | 2 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 81 | |
| : | 14 | 13.0% |
| " | 6 | 5.6% |
| , | 4 | 3.7% |
| ; | 3 | 2.8% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 106 | |
| + | 36 | 22.4% |
| ~ | 19 | 11.8% |
Space Separator
| Value | Count | Frequency (%) |
| 443 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11323 | |
| Common | 1434 | 11.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | |
| d | 877 | |
| l | 806 | |
| I | 353 | 3.1% |
| m | 347 | 3.1% |
| Other values (19) | 1334 |
Common
| Value | Count | Frequency (%) |
| 443 | ||
| 1 | 252 | |
| 0 | 198 | |
| < | 106 | 7.4% |
| - | 102 | 7.1% |
| 5 | 82 | 5.7% |
| . | 81 | 5.6% |
| + | 36 | 2.5% |
| 2 | 29 | 2.0% |
| ~ | 19 | 1.3% |
| Other values (12) | 86 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12757 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | 7.0% |
| d | 877 | 6.9% |
| l | 806 | 6.3% |
| 443 | 3.5% | |
| I | 353 | 2.8% |
| Other values (41) | 2672 |
minimumDistanceAboveSurfaceInMeters
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338092 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 30.5 |
| Mean length | 30.5 |
| Min length | 21 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Carpenter, Kent E.; Williams, Jeffrey T. |
|---|---|
| 2nd row | Kirkbride, J. H., Jr. |
| Value | Count | Frequency (%) |
| carpenter | 1 | |
| kent | 1 | |
| e | 1 | |
| williams | 1 | |
| jeffrey | 1 | |
| t | 1 | |
| kirkbride | 1 | |
| j | 1 | |
| h | 1 | |
| jr | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | ||
| r | 6 | 9.8% |
| e | 6 | 9.8% |
| . | 5 | 8.2% |
| i | 4 | 6.6% |
| , | 4 | 6.6% |
| J | 3 | 4.9% |
| l | 2 | 3.3% |
| a | 2 | 3.3% |
| K | 2 | 3.3% |
| Other values (16) | 19 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33 | |
| Other Punctuation | 10 | 16.4% |
| Uppercase Letter | 10 | 16.4% |
| Space Separator | 8 | 13.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 6 | |
| e | 6 | |
| i | 4 | |
| l | 2 | 6.1% |
| a | 2 | 6.1% |
| f | 2 | 6.1% |
| t | 2 | 6.1% |
| n | 2 | 6.1% |
| b | 1 | 3.0% |
| k | 1 | 3.0% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 3 | |
| K | 2 | |
| T | 1 | 10.0% |
| C | 1 | 10.0% |
| W | 1 | 10.0% |
| E | 1 | 10.0% |
| H | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 | |
| , | 4 | |
| ; | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43 | |
| Common | 18 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 6 | |
| e | 6 | |
| i | 4 | 9.3% |
| J | 3 | 7.0% |
| l | 2 | 4.7% |
| a | 2 | 4.7% |
| K | 2 | 4.7% |
| f | 2 | 4.7% |
| t | 2 | 4.7% |
| n | 2 | 4.7% |
| Other values (12) | 12 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| . | 5 | |
| , | 4 | |
| ; | 1 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | ||
| r | 6 | 9.8% |
| e | 6 | 9.8% |
| . | 5 | 8.2% |
| i | 4 | 6.6% |
| , | 4 | 6.6% |
| J | 3 | 4.9% |
| l | 2 | 3.3% |
| a | 2 | 3.3% |
| K | 2 | 3.3% |
| Other values (16) | 19 |
decimalLatitude
Text
Missing 
| Distinct | 22660 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 73462 |
| Missing (%) | 21.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.760010127 |
| Min length | 3 |
Unique
| Unique | 3758 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 31.434 |
|---|---|
| 2nd row | 27.5772 |
| 3rd row | -17.4756 |
| 4th row | 28.0392 |
| 5th row | 0.293 |
| Value | Count | Frequency (%) |
| 12.0832 | 1365 | 0.5% |
| 16.802 | 1085 | 0.4% |
| 22.0 | 898 | 0.3% |
| 31.7306 | 892 | 0.3% |
| 5.0 | 791 | 0.3% |
| 17.4726 | 765 | 0.3% |
| 38.6141 | 726 | 0.3% |
| 34.9606 | 681 | 0.3% |
| 17.4825 | 679 | 0.3% |
| 9.82436 | 665 | 0.3% |
| Other values (22418) | 256085 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 264632 | |
| 3 | 225950 | |
| 1 | 184563 | |
| 2 | 163065 | |
| 7 | 157629 | |
| 4 | 151752 | |
| 8 | 134492 | |
| 5 | 130049 | |
| 6 | 125987 | |
| 9 | 108015 | |
| Other values (2) | 142781 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1471419 | |
| Other Punctuation | 264632 | 14.8% |
| Dash Punctuation | 52864 | 3.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 225950 | |
| 1 | 184563 | |
| 2 | 163065 | |
| 7 | 157629 | |
| 4 | 151752 | |
| 8 | 134492 | |
| 5 | 130049 | |
| 6 | 125987 | |
| 9 | 108015 | |
| 0 | 89917 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 264632 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52864 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1788915 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 264632 | |
| 3 | 225950 | |
| 1 | 184563 | |
| 2 | 163065 | |
| 7 | 157629 | |
| 4 | 151752 | |
| 8 | 134492 | |
| 5 | 130049 | |
| 6 | 125987 | |
| 9 | 108015 | |
| Other values (2) | 142781 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1788915 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 264632 | |
| 3 | 225950 | |
| 1 | 184563 | |
| 2 | 163065 | |
| 7 | 157629 | |
| 4 | 151752 | |
| 8 | 134492 | |
| 5 | 130049 | |
| 6 | 125987 | |
| 9 | 108015 | |
| Other values (2) | 142781 |
decimalLongitude
Text
Missing 
| Distinct | 21514 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 73462 |
| Missing (%) | 21.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 7.494804105 |
| Min length | 3 |
Unique
| Unique | 3512 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | -110.285 |
|---|---|
| 2nd row | -111.45 |
| 3rd row | -149.842 |
| 4th row | 85.9858 |
| 5th row | 36.899 |
| Value | Count | Frequency (%) |
| 68.8991 | 1347 | 0.5% |
| 56.1167 | 1222 | 0.5% |
| 149.826 | 1218 | 0.5% |
| 88.082 | 1101 | 0.4% |
| 149.775 | 1056 | 0.4% |
| 110.881 | 910 | 0.3% |
| 88.0817 | 835 | 0.3% |
| 80.2986 | 742 | 0.3% |
| 90.2589 | 731 | 0.3% |
| 176.0 | 682 | 0.3% |
| Other values (21348) | 254788 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 264632 | |
| 1 | 242905 | |
| - | 217262 | |
| 8 | 175311 | |
| 7 | 174536 | |
| 9 | 158724 | |
| 6 | 132513 | |
| 4 | 129672 | |
| 2 | 128120 | |
| 5 | 123756 | |
| Other values (2) | 235934 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1501471 | |
| Other Punctuation | 264632 | 13.3% |
| Dash Punctuation | 217262 | 11.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 242905 | |
| 8 | 175311 | |
| 7 | 174536 | |
| 9 | 158724 | |
| 6 | 132513 | |
| 4 | 129672 | |
| 2 | 128120 | |
| 5 | 123756 | |
| 3 | 122039 | |
| 0 | 113895 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 264632 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 217262 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1983365 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 264632 | |
| 1 | 242905 | |
| - | 217262 | |
| 8 | 175311 | |
| 7 | 174536 | |
| 9 | 158724 | |
| 6 | 132513 | |
| 4 | 129672 | |
| 2 | 128120 | |
| 5 | 123756 | |
| Other values (2) | 235934 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1983365 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 264632 | |
| 1 | 242905 | |
| - | 217262 | |
| 8 | 175311 | |
| 7 | 174536 | |
| 9 | 158724 | |
| 6 | 132513 | |
| 4 | 129672 | |
| 2 | 128120 | |
| 5 | 123756 | |
| Other values (2) | 235934 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 453 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 327083 |
| Missing (%) | 96.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 5.133412043 |
| Min length | 3 |
Unique
| Unique | 40 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 500.0 |
|---|---|
| 2nd row | 500.0 |
| 3rd row | 140000.0 |
| 4th row | 100.0 |
| 5th row | 100.0 |
| Value | Count | Frequency (%) |
| 100.0 | 1571 | 14.3% |
| 5.0 | 435 | 4.0% |
| 14.0 | 400 | 3.6% |
| 12.0 | 386 | 3.5% |
| 500.0 | 365 | 3.3% |
| 10.0 | 311 | 2.8% |
| 32.0 | 277 | 2.5% |
| 200.0 | 273 | 2.5% |
| 15.0 | 255 | 2.3% |
| 23.0 | 231 | 2.1% |
| Other values (443) | 6507 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 17836 | |
| . | 11011 | |
| 1 | 6690 | 11.8% |
| 2 | 4823 | 8.5% |
| 5 | 3484 | 6.2% |
| 4 | 3318 | 5.9% |
| 3 | 2873 | 5.1% |
| 7 | 1789 | 3.2% |
| 8 | 1687 | 3.0% |
| 6 | 1652 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 45513 | |
| Other Punctuation | 11011 | 19.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 17836 | |
| 1 | 6690 | 14.7% |
| 2 | 4823 | 10.6% |
| 5 | 3484 | 7.7% |
| 4 | 3318 | 7.3% |
| 3 | 2873 | 6.3% |
| 7 | 1789 | 3.9% |
| 8 | 1687 | 3.7% |
| 6 | 1652 | 3.6% |
| 9 | 1361 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11011 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 56524 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 17836 | |
| . | 11011 | |
| 1 | 6690 | 11.8% |
| 2 | 4823 | 8.5% |
| 5 | 3484 | 6.2% |
| 4 | 3318 | 5.9% |
| 3 | 2873 | 5.1% |
| 7 | 1789 | 3.2% |
| 8 | 1687 | 3.0% |
| 6 | 1652 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 17836 | |
| . | 11011 | |
| 1 | 6690 | 11.8% |
| 2 | 4823 | 8.5% |
| 5 | 3484 | 6.2% |
| 4 | 3318 | 5.9% |
| 3 | 2873 | 5.1% |
| 7 | 1789 | 3.2% |
| 8 | 1687 | 3.0% |
| 6 | 1652 | 2.9% |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2387143 |
|---|---|
| 2nd row | 2906907 |
| 3rd row | 2463461 |
| 4th row | 2974262 |
| Value | Count | Frequency (%) |
| 2387143 | 1 | |
| 2906907 | 1 | |
| 2463461 | 1 | |
| 2974262 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 329029 |
| Missing (%) | 97.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.7463872 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 8918 | |
| minutes | 8843 | |
| seconds | 8843 | |
| township | 107 | 0.4% |
| range | 107 | 0.4% |
| decimal | 75 | 0.3% |
| utm | 24 | 0.1% |
| unknown | 16 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 44622 | |
| s | 26711 | |
| n | 17948 | 8.7% |
| 17868 | 8.7% | |
| g | 9025 | 4.4% |
| i | 9025 | 4.4% |
| D | 8982 | 4.4% |
| o | 8966 | 4.3% |
| c | 8918 | 4.3% |
| r | 8918 | 4.3% |
| Other values (15) | 45213 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161358 | |
| Uppercase Letter | 26970 | 13.1% |
| Space Separator | 17868 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 44622 | |
| s | 26711 | |
| n | 17948 | |
| g | 9025 | 5.6% |
| i | 9025 | 5.6% |
| o | 8966 | 5.6% |
| c | 8918 | 5.5% |
| r | 8918 | 5.5% |
| d | 8854 | 5.5% |
| u | 8843 | 5.5% |
| Other values (8) | 9528 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 8982 | |
| M | 8867 | |
| S | 8843 | |
| T | 131 | 0.5% |
| R | 107 | 0.4% |
| U | 40 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 17868 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 188328 | |
| Common | 17868 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 44622 | |
| s | 26711 | |
| n | 17948 | |
| g | 9025 | 4.8% |
| i | 9025 | 4.8% |
| D | 8982 | 4.8% |
| o | 8966 | 4.8% |
| c | 8918 | 4.7% |
| r | 8918 | 4.7% |
| M | 8867 | 4.7% |
| Other values (14) | 36346 |
Common
| Value | Count | Frequency (%) |
| 17868 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 206196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 44622 | |
| s | 26711 | |
| n | 17948 | 8.7% |
| 17868 | 8.7% | |
| g | 9025 | 4.4% |
| i | 9025 | 4.4% |
| D | 8982 | 4.4% |
| o | 8966 | 4.3% |
| c | 8918 | 4.3% |
| r | 8918 | 4.3% |
| Other values (15) | 45213 |
georeferencedBy
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 35 |
| Mean length | 32.25 |
| Min length | 19 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon nudivittis (Ogilby, 1895) |
|---|---|
| 2nd row | Coccocypselum guianense (Aubl.) K.Schum. |
| 3rd row | Emoia caeruleocauda (De Vis, 1892) |
| 4th row | Dimorphandra Schott |
| Value | Count | Frequency (%) |
| champsodon | 1 | 6.7% |
| nudivittis | 1 | 6.7% |
| ogilby | 1 | 6.7% |
| 1895 | 1 | 6.7% |
| coccocypselum | 1 | 6.7% |
| guianense | 1 | 6.7% |
| aubl | 1 | 6.7% |
| k.schum | 1 | 6.7% |
| emoia | 1 | 6.7% |
| caeruleocauda | 1 | 6.7% |
| Other values (5) | 5 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11 | 8.5% | |
| i | 8 | 6.2% |
| a | 8 | 6.2% |
| o | 8 | 6.2% |
| c | 7 | 5.4% |
| u | 7 | 5.4% |
| e | 6 | 4.7% |
| m | 5 | 3.9% |
| s | 5 | 3.9% |
| n | 5 | 3.9% |
| Other values (27) | 59 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 88 | |
| Space Separator | 11 | 8.5% |
| Uppercase Letter | 11 | 8.5% |
| Decimal Number | 8 | 6.2% |
| Other Punctuation | 5 | 3.9% |
| Close Punctuation | 3 | 2.3% |
| Open Punctuation | 3 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 8 | 9.1% |
| a | 8 | 9.1% |
| o | 8 | 9.1% |
| c | 7 | 8.0% |
| u | 7 | 8.0% |
| e | 6 | 6.8% |
| m | 5 | 5.7% |
| s | 5 | 5.7% |
| n | 5 | 5.7% |
| l | 4 | 4.5% |
| Other values (9) | 25 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| D | 2 | |
| C | 2 | |
| A | 1 | |
| K | 1 | |
| E | 1 | |
| O | 1 | |
| V | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 8 | 2 | |
| 1 | 2 | |
| 5 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 99 | |
| Common | 30 | 23.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 8 | 8.1% |
| a | 8 | 8.1% |
| o | 8 | 8.1% |
| c | 7 | 7.1% |
| u | 7 | 7.1% |
| e | 6 | 6.1% |
| m | 5 | 5.1% |
| s | 5 | 5.1% |
| n | 5 | 5.1% |
| l | 4 | 4.0% |
| Other values (17) | 36 |
Common
| Value | Count | Frequency (%) |
| 11 | ||
| . | 3 | 10.0% |
| ) | 3 | 10.0% |
| ( | 3 | 10.0% |
| 9 | 2 | 6.7% |
| 8 | 2 | 6.7% |
| 1 | 2 | 6.7% |
| , | 2 | 6.7% |
| 5 | 1 | 3.3% |
| 2 | 1 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11 | 8.5% | |
| i | 8 | 6.2% |
| a | 8 | 6.2% |
| o | 8 | 6.2% |
| c | 7 | 5.4% |
| u | 7 | 5.4% |
| e | 6 | 4.7% |
| m | 5 | 3.9% |
| s | 5 | 3.9% |
| n | 5 | 3.9% |
| Other values (27) | 59 |
Missing 
| Distinct | 172 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 255273 |
| Missing (%) | 75.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 228 |
|---|---|
| Median length | 12 |
| Mean length | 16.00842781 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Google Earth |
|---|---|
| 2nd row | Google Earth |
| 3rd row | Google Earth |
| 4th row | GeoLocate |
| 5th row | Google Earth |
| Value | Count | Frequency (%) |
| 50688 | ||
| earth | 44693 | |
| gps | 24170 | 11.6% |
| maps | 6421 | 3.1% |
| georeferencing | 4994 | 2.4% |
| and | 3621 | 1.7% |
| pro | 3250 | 1.6% |
| for | 3177 | 1.5% |
| to | 3177 | 1.5% |
| best | 3176 | 1.5% |
| Other values (336) | 60464 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 139882 | 10.6% |
| 125010 | 9.4% | |
| e | 116797 | 8.8% |
| r | 91824 | 6.9% |
| G | 90277 | 6.8% |
| a | 86798 | 6.5% |
| t | 72957 | 5.5% |
| g | 60276 | 4.5% |
| l | 58288 | 4.4% |
| h | 52448 | 4.0% |
| Other values (59) | 431277 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 892189 | |
| Uppercase Letter | 249071 | 18.8% |
| Space Separator | 125010 | 9.4% |
| Other Punctuation | 25668 | 1.9% |
| Decimal Number | 24634 | 1.9% |
| Close Punctuation | 4362 | 0.3% |
| Open Punctuation | 4362 | 0.3% |
| Dash Punctuation | 538 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 139882 | |
| e | 116797 | |
| r | 91824 | |
| a | 86798 | |
| t | 72957 | |
| g | 60276 | |
| l | 58288 | |
| h | 52448 | 5.9% |
| i | 37064 | 4.2% |
| n | 36365 | 4.1% |
| Other values (15) | 139490 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 90277 | |
| E | 47210 | |
| P | 31285 | 12.6% |
| S | 30510 | 12.2% |
| M | 8765 | 3.5% |
| N | 6259 | 2.5% |
| C | 5716 | 2.3% |
| I | 4591 | 1.8% |
| B | 3743 | 1.5% |
| W | 3524 | 1.4% |
| Other values (13) | 17191 | 6.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9139 | |
| 2 | 5810 | |
| 6 | 3901 | |
| 1 | 2227 | 9.0% |
| 7 | 1431 | 5.8% |
| 9 | 916 | 3.7% |
| 4 | 561 | 2.3% |
| 5 | 509 | 2.1% |
| 3 | 84 | 0.3% |
| 8 | 56 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11873 | |
| , | 6280 | |
| / | 5940 | |
| : | 1370 | 5.3% |
| & | 153 | 0.6% |
| ! | 40 | 0.2% |
| ; | 12 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 125010 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4362 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4362 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 538 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1141260 | |
| Common | 184574 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 139882 | |
| e | 116797 | 10.2% |
| r | 91824 | 8.0% |
| G | 90277 | 7.9% |
| a | 86798 | 7.6% |
| t | 72957 | 6.4% |
| g | 60276 | 5.3% |
| l | 58288 | 5.1% |
| h | 52448 | 4.6% |
| E | 47210 | 4.1% |
| Other values (38) | 324503 |
Common
| Value | Count | Frequency (%) |
| 125010 | ||
| . | 11873 | 6.4% |
| 0 | 9139 | 5.0% |
| , | 6280 | 3.4% |
| / | 5940 | 3.2% |
| 2 | 5810 | 3.1% |
| ) | 4362 | 2.4% |
| ( | 4362 | 2.4% |
| 6 | 3901 | 2.1% |
| 1 | 2227 | 1.2% |
| Other values (11) | 5670 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1325834 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 139882 | 10.6% |
| 125010 | 9.4% | |
| e | 116797 | 8.8% |
| r | 91824 | 6.9% |
| G | 90277 | 6.8% |
| a | 86798 | 6.5% |
| t | 72957 | 5.5% |
| g | 60276 | 4.5% |
| l | 58288 | 4.4% |
| h | 52448 | 4.0% |
| Other values (59) | 431277 |
Missing 
| Distinct | 224 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 328595 |
| Missing (%) | 97.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 83 |
|---|---|
| Median length | 51 |
| Mean length | 18.53163491 |
| Min length | 2 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Max error (m): 100 |
|---|---|
| 2nd row | Max error (m): 40 |
| 3rd row | Locality extent = 1.6 |
| 4th row | Locality extent = 1 mile |
| 5th row | Max error (m): 200 |
| Value | Count | Frequency (%) |
| m | 5353 | |
| max | 4966 | |
| error | 4966 | |
| 1990 | 5.4% | |
| locality | 1819 | 4.9% |
| extent | 1818 | 4.9% |
| 100 | 1765 | 4.8% |
| 50 | 914 | 2.5% |
| 200 | 739 | 2.0% |
| 4 | 668 | 1.8% |
| Other values (241) | 11820 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27319 | ||
| r | 16672 | 9.5% |
| e | 10647 | 6.0% |
| o | 10229 | 5.8% |
| a | 10113 | 5.7% |
| t | 9606 | 5.5% |
| 0 | 8337 | 4.7% |
| x | 7001 | 4.0% |
| m | 6536 | 3.7% |
| n | 5377 | 3.1% |
| Other values (53) | 64195 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 94988 | |
| Space Separator | 27319 | 15.5% |
| Decimal Number | 20310 | 11.5% |
| Uppercase Letter | 12876 | 7.3% |
| Other Punctuation | 8461 | 4.8% |
| Open Punctuation | 4969 | 2.8% |
| Close Punctuation | 4969 | 2.8% |
| Math Symbol | 1818 | 1.0% |
| Dash Punctuation | 322 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 16672 | |
| e | 10647 | |
| o | 10229 | |
| a | 10113 | |
| t | 9606 | |
| x | 7001 | |
| m | 6536 | 6.9% |
| n | 5377 | 5.7% |
| i | 4178 | 4.4% |
| l | 2810 | 3.0% |
| Other values (13) | 11819 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4966 | |
| L | 1894 | 14.7% |
| S | 1112 | 8.6% |
| E | 1102 | 8.6% |
| W | 998 | 7.8% |
| G | 774 | 6.0% |
| C | 408 | 3.2% |
| H | 372 | 2.9% |
| V | 253 | 2.0% |
| R | 238 | 1.8% |
| Other values (9) | 759 | 5.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8337 | |
| 1 | 3655 | |
| 5 | 2478 | 12.2% |
| 2 | 1519 | 7.5% |
| 4 | 1397 | 6.9% |
| 8 | 935 | 4.6% |
| 6 | 713 | 3.5% |
| 3 | 496 | 2.4% |
| 7 | 441 | 2.2% |
| 9 | 339 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4966 | |
| . | 1937 | 22.9% |
| ; | 1213 | 14.3% |
| , | 311 | 3.7% |
| / | 31 | 0.4% |
| ' | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27319 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4969 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4969 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1818 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 107864 | |
| Common | 68168 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 16672 | |
| e | 10647 | |
| o | 10229 | |
| a | 10113 | |
| t | 9606 | |
| x | 7001 | 6.5% |
| m | 6536 | 6.1% |
| n | 5377 | 5.0% |
| M | 4966 | 4.6% |
| i | 4178 | 3.9% |
| Other values (32) | 22539 |
Common
| Value | Count | Frequency (%) |
| 27319 | ||
| 0 | 8337 | 12.2% |
| ( | 4969 | 7.3% |
| ) | 4969 | 7.3% |
| : | 4966 | 7.3% |
| 1 | 3655 | 5.4% |
| 5 | 2478 | 3.6% |
| . | 1937 | 2.8% |
| = | 1818 | 2.7% |
| 2 | 1519 | 2.2% |
| Other values (11) | 6201 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 176032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 27319 | ||
| r | 16672 | 9.5% |
| e | 10647 | 6.0% |
| o | 10229 | 5.8% |
| a | 10113 | 5.7% |
| t | 9606 | 5.5% |
| 0 | 8337 | 4.7% |
| x | 7001 | 4.0% |
| m | 6536 | 3.7% |
| n | 5377 | 3.1% |
| Other values (53) | 64195 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 134 |
|---|---|
| Median length | 71 |
| Mean length | 83.5 |
| Min length | 58 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Trachinoidei, Champsodontidae |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Gentianales, Rubiaceae, Rubioideae |
| 3rd row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Scincidae, Eugongylinae |
| 4th row | Plantae, Dicotyledonae, Fabales, Fabaceae, Caesalpinioideae |
| Value | Count | Frequency (%) |
| animalia | 2 | 7.1% |
| plantae | 2 | 7.1% |
| chordata | 2 | 7.1% |
| dicotyledonae | 2 | 7.1% |
| vertebrata | 2 | 7.1% |
| actinopterygii | 1 | 3.6% |
| rubioideae | 1 | 3.6% |
| fabaceae | 1 | 3.6% |
| fabales | 1 | 3.6% |
| eugongylinae | 1 | 3.6% |
| Other values (13) | 13 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | 10.5% |
| i | 32 | 9.6% |
| 24 | 7.2% | |
| , | 24 | 7.2% |
| t | 21 | 6.3% |
| n | 16 | 4.8% |
| o | 16 | 4.8% |
| r | 13 | 3.9% |
| l | 11 | 3.3% |
| Other values (25) | 99 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 258 | |
| Uppercase Letter | 28 | 8.4% |
| Space Separator | 24 | 7.2% |
| Other Punctuation | 24 | 7.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | |
| i | 32 | |
| t | 21 | |
| n | 16 | 6.2% |
| o | 16 | 6.2% |
| r | 13 | 5.0% |
| l | 11 | 4.3% |
| c | 11 | 4.3% |
| d | 10 | 3.9% |
| Other values (10) | 50 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| C | 4 | |
| S | 3 | |
| P | 3 | |
| R | 3 | |
| D | 2 | |
| F | 2 | |
| V | 2 | |
| T | 1 | 3.6% |
| G | 1 | 3.6% |
| Other values (3) | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 286 | |
| Common | 48 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | |
| i | 32 | |
| t | 21 | 7.3% |
| n | 16 | 5.6% |
| o | 16 | 5.6% |
| r | 13 | 4.5% |
| l | 11 | 3.8% |
| c | 11 | 3.8% |
| d | 10 | 3.5% |
| Other values (23) | 78 |
Common
| Value | Count | Frequency (%) |
| 24 | ||
| , | 24 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 334 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | 10.5% |
| i | 32 | 9.6% |
| 24 | 7.2% | |
| , | 24 | 7.2% |
| t | 21 | 6.3% |
| n | 16 | 4.8% |
| o | 16 | 4.8% |
| r | 13 | 3.9% |
| l | 11 | 3.3% |
| Other values (25) | 99 |
earliestEraOrLowestErathem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7.5 |
| Mean length | 7.5 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Plantae |
| 3rd row | Animalia |
| 4th row | Plantae |
| Value | Count | Frequency (%) |
| animalia | 2 | |
| plantae | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| A | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 2 | 6.7% |
| t | 2 | 6.7% |
| e | 2 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26 | |
| Uppercase Letter | 4 | 13.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| m | 2 | 7.7% |
| t | 2 | 7.7% |
| e | 2 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| A | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 2 | 6.7% |
| t | 2 | 6.7% |
| e | 2 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| A | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 2 | 6.7% |
| t | 2 | 6.7% |
| e | 2 | 6.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Tracheophyta |
| 3rd row | Chordata |
| 4th row | Tracheophyta |
| Value | Count | Frequency (%) |
| chordata | 2 | |
| tracheophyta | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| h | 6 | |
| o | 4 | |
| r | 4 | |
| t | 4 | |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| T | 2 | 5.0% |
| c | 2 | 5.0% |
| e | 2 | 5.0% |
| Other values (2) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36 | |
| Uppercase Letter | 4 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| h | 6 | |
| o | 4 | |
| r | 4 | |
| t | 4 | |
| d | 2 | 5.6% |
| c | 2 | 5.6% |
| e | 2 | 5.6% |
| p | 2 | 5.6% |
| y | 2 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| T | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| h | 6 | |
| o | 4 | |
| r | 4 | |
| t | 4 | |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| T | 2 | 5.0% |
| c | 2 | 5.0% |
| e | 2 | 5.0% |
| Other values (2) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| h | 6 | |
| o | 4 | |
| r | 4 | |
| t | 4 | |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| T | 2 | 5.0% |
| c | 2 | 5.0% |
| e | 2 | 5.0% |
| Other values (2) | 4 |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.33333333 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | Magnoliopsida |
|---|---|
| 2nd row | Squamata |
| 3rd row | Magnoliopsida |
| Value | Count | Frequency (%) |
| magnoliopsida | 2 | |
| squamata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7 | |
| o | 4 | |
| i | 4 | |
| M | 2 | 5.9% |
| g | 2 | 5.9% |
| n | 2 | 5.9% |
| l | 2 | 5.9% |
| p | 2 | 5.9% |
| s | 2 | 5.9% |
| d | 2 | 5.9% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31 | |
| Uppercase Letter | 3 | 8.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7 | |
| o | 4 | |
| i | 4 | |
| g | 2 | 6.5% |
| n | 2 | 6.5% |
| l | 2 | 6.5% |
| p | 2 | 6.5% |
| s | 2 | 6.5% |
| d | 2 | 6.5% |
| q | 1 | 3.2% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7 | |
| o | 4 | |
| i | 4 | |
| M | 2 | 5.9% |
| g | 2 | 5.9% |
| n | 2 | 5.9% |
| l | 2 | 5.9% |
| p | 2 | 5.9% |
| s | 2 | 5.9% |
| d | 2 | 5.9% |
| Other values (5) | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7 | |
| o | 4 | |
| i | 4 | |
| M | 2 | 5.9% |
| g | 2 | 5.9% |
| n | 2 | 5.9% |
| l | 2 | 5.9% |
| p | 2 | 5.9% |
| s | 2 | 5.9% |
| d | 2 | 5.9% |
| Other values (5) | 5 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 9.666666667 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Perciformes |
|---|---|
| 2nd row | Gentianales |
| 3rd row | Fabales |
| Value | Count | Frequency (%) |
| perciformes | 1 | |
| gentianales | 1 | |
| fabales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5 | |
| a | 4 | |
| s | 3 | |
| r | 2 | 6.9% |
| i | 2 | 6.9% |
| n | 2 | 6.9% |
| l | 2 | 6.9% |
| P | 1 | 3.4% |
| c | 1 | 3.4% |
| f | 1 | 3.4% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26 | |
| Uppercase Letter | 3 | 10.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| a | 4 | |
| s | 3 | |
| r | 2 | 7.7% |
| i | 2 | 7.7% |
| n | 2 | 7.7% |
| l | 2 | 7.7% |
| c | 1 | 3.8% |
| f | 1 | 3.8% |
| o | 1 | 3.8% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| G | 1 | |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| a | 4 | |
| s | 3 | |
| r | 2 | 6.9% |
| i | 2 | 6.9% |
| n | 2 | 6.9% |
| l | 2 | 6.9% |
| P | 1 | 3.4% |
| c | 1 | 3.4% |
| f | 1 | 3.4% |
| Other values (6) | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5 | |
| a | 4 | |
| s | 3 | |
| r | 2 | 6.9% |
| i | 2 | 6.9% |
| n | 2 | 6.9% |
| l | 2 | 6.9% |
| P | 1 | 3.4% |
| c | 1 | 3.4% |
| f | 1 | 3.4% |
| Other values (6) | 6 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 10.25 |
| Min length | 8 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodontidae |
|---|---|
| 2nd row | Rubiaceae |
| 3rd row | Scincidae |
| 4th row | Fabaceae |
| Value | Count | Frequency (%) |
| champsodontidae | 1 | |
| rubiaceae | 1 | |
| scincidae | 1 | |
| fabaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 7.3% |
| b | 2 | 4.9% |
| o | 2 | 4.9% |
| n | 2 | 4.9% |
| C | 1 | 2.4% |
| R | 1 | 2.4% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37 | |
| Uppercase Letter | 4 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 8.1% |
| b | 2 | 5.4% |
| o | 2 | 5.4% |
| n | 2 | 5.4% |
| u | 1 | 2.7% |
| t | 1 | 2.7% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| R | 1 | |
| S | 1 | |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 7.3% |
| b | 2 | 4.9% |
| o | 2 | 4.9% |
| n | 2 | 4.9% |
| C | 1 | 2.4% |
| R | 1 | 2.4% |
| Other values (8) | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 7.3% |
| b | 2 | 4.9% |
| o | 2 | 4.9% |
| n | 2 | 4.9% |
| C | 1 | 2.4% |
| R | 1 | 2.4% |
| Other values (8) | 8 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon |
|---|---|
| 2nd row | Coccocypselum |
| 3rd row | Emoia |
| 4th row | Dimorphandra |
| Value | Count | Frequency (%) |
| champsodon | 1 | |
| coccocypselum | 1 | |
| emoia | 1 | |
| dimorphandra | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36 | |
| Uppercase Letter | 4 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | |
| m | 4 | |
| c | 3 | |
| p | 3 | |
| n | 2 | 5.6% |
| i | 2 | 5.6% |
| h | 2 | 5.6% |
| d | 2 | 5.6% |
| s | 2 | 5.6% |
| Other values (5) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 1 | |
| D | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon |
|---|---|
| 2nd row | Coccocypselum |
| 3rd row | Emoia |
| 4th row | Dimorphandra |
| Value | Count | Frequency (%) |
| champsodon | 1 | |
| coccocypselum | 1 | |
| emoia | 1 | |
| dimorphandra | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36 | |
| Uppercase Letter | 4 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | |
| m | 4 | |
| c | 3 | |
| p | 3 | |
| n | 2 | 5.6% |
| i | 2 | 5.6% |
| h | 2 | 5.6% |
| d | 2 | 5.6% |
| s | 2 | 5.6% |
| Other values (5) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 1 | |
| D | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
member
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10.66666667 |
| Min length | 9 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | nudivittis |
|---|---|
| 2nd row | guianense |
| 3rd row | caeruleocauda |
| Value | Count | Frequency (%) |
| nudivittis | 1 | |
| guianense | 1 | |
| caeruleocauda | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| d | 2 | |
| t | 2 | |
| s | 2 | |
| c | 2 | |
| v | 1 | 3.1% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| d | 2 | |
| t | 2 | |
| s | 2 | |
| c | 2 | |
| v | 1 | 3.1% |
| Other values (4) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| d | 2 | |
| t | 2 | |
| s | 2 | |
| c | 2 | |
| v | 1 | 3.1% |
| Other values (4) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| d | 2 | |
| t | 2 | |
| s | 2 | |
| c | 2 | |
| v | 1 | 3.1% |
| Other values (4) | 4 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 25.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | GENUS |
| Value | Count | Frequency (%) |
| species | 3 | |
| genus | 1 | 25.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 7 | |
| E | 7 | |
| P | 3 | |
| C | 3 | |
| I | 3 | |
| G | 1 | 3.8% |
| N | 1 | 3.8% |
| U | 1 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 26 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 7 | |
| E | 7 | |
| P | 3 | |
| C | 3 | |
| I | 3 | |
| G | 1 | 3.8% |
| N | 1 | 3.8% |
| U | 1 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 7 | |
| E | 7 | |
| P | 3 | |
| C | 3 | |
| I | 3 | |
| G | 1 | 3.8% |
| N | 1 | 3.8% |
| U | 1 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 7 | |
| E | 7 | |
| P | 3 | |
| C | 3 | |
| I | 3 | |
| G | 1 | 3.8% |
| N | 1 | 3.8% |
| U | 1 | 3.8% |
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 333028 |
| Missing (%) | 98.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.255823135 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | aff. |
|---|---|
| 2nd row | cf. |
| 3rd row | aff. |
| 4th row | uncertain |
| 5th row | uncertain |
| Value | Count | Frequency (%) |
| cf | 2738 | |
| uncertain | 1858 | |
| aff | 320 | 6.3% |
| near | 75 | 1.5% |
| complex | 38 | 0.7% |
| sp | 16 | 0.3% |
| group | 12 | 0.2% |
| n | 10 | 0.2% |
| nov | 6 | 0.1% |
| s.l | 5 | 0.1% |
| Other values (2) | 9 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 4628 | |
| n | 3807 | |
| f | 3378 | |
| . | 2728 | |
| a | 2237 | |
| e | 1976 | |
| r | 1945 | |
| t | 1858 | |
| i | 1858 | |
| u | 1842 | 6.9% |
| Other values (12) | 369 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23827 | |
| Other Punctuation | 2728 | 10.2% |
| Uppercase Letter | 50 | 0.2% |
| Space Separator | 21 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4628 | |
| n | 3807 | |
| f | 3378 | |
| a | 2237 | |
| e | 1976 | |
| r | 1945 | |
| t | 1858 | |
| i | 1858 | |
| u | 1842 | 7.7% |
| p | 66 | 0.3% |
| Other values (7) | 232 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 28 | |
| A | 16 | |
| C | 6 | 12.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2728 |
Space Separator
| Value | Count | Frequency (%) |
| 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23877 | |
| Common | 2749 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 4628 | |
| n | 3807 | |
| f | 3378 | |
| a | 2237 | |
| e | 1976 | |
| r | 1945 | |
| t | 1858 | |
| i | 1858 | |
| u | 1842 | 7.7% |
| p | 66 | 0.3% |
| Other values (10) | 282 | 1.2% |
Common
| Value | Count | Frequency (%) |
| . | 2728 | |
| 21 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26626 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 4628 | |
| n | 3807 | |
| f | 3378 | |
| . | 2728 | |
| a | 2237 | |
| e | 1976 | |
| r | 1945 | |
| t | 1858 | |
| i | 1858 | |
| u | 1842 | 6.9% |
| Other values (12) | 369 | 1.4% |
typeStatus
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 331537 |
| Missing (%) | 98.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.010370596 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | PARATYPE |
| 3rd row | PARATYPE |
| 4th row | PARATYPE |
| 5th row | PARATYPE |
| Value | Count | Frequency (%) |
| paratype | 5817 | |
| holotype | 330 | 5.0% |
| paralectotype | 125 | 1.9% |
| cotype | 86 | 1.3% |
| syntype | 76 | 1.2% |
| type | 73 | 1.1% |
| allotype | 23 | 0.4% |
| neotype | 13 | 0.2% |
| topotype | 10 | 0.2% |
| isotype | 4 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 12509 | |
| A | 11907 | |
| E | 6695 | |
| T | 6692 | |
| Y | 6633 | |
| R | 5942 | |
| O | 931 | 1.8% |
| L | 501 | 1.0% |
| H | 330 | 0.6% |
| C | 211 | 0.4% |
| Other values (3) | 173 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 52524 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 12509 | |
| A | 11907 | |
| E | 6695 | |
| T | 6692 | |
| Y | 6633 | |
| R | 5942 | |
| O | 931 | 1.8% |
| L | 501 | 1.0% |
| H | 330 | 0.6% |
| C | 211 | 0.4% |
| Other values (3) | 173 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 52524 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 12509 | |
| A | 11907 | |
| E | 6695 | |
| T | 6692 | |
| Y | 6633 | |
| R | 5942 | |
| O | 931 | 1.8% |
| L | 501 | 1.0% |
| H | 330 | 0.6% |
| C | 211 | 0.4% |
| Other values (3) | 173 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 12509 | |
| A | 11907 | |
| E | 6695 | |
| T | 6692 | |
| Y | 6633 | |
| R | 5942 | |
| O | 931 | 1.8% |
| L | 501 | 1.0% |
| H | 330 | 0.6% |
| C | 211 | 0.4% |
| Other values (3) | 173 | 0.3% |
identifiedBy
Text
Missing 
| Distinct | 1866 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 226045 |
| Missing (%) | 66.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 150 |
|---|---|
| Median length | 128 |
| Mean length | 39.12826531 |
| Min length | 2 |
Unique
| Unique | 200 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Anker, Arthur |
|---|---|
| 2nd row | Osborn, Karen J., (IZ), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 3rd row | Baldwin, Carole C. |
| 4th row | Hobbs, Horton H., Jr., Smithsonian Institution, National Museum of Natural History |
| 5th row | Paulay, Gustav, University of Florida (UNITED STATES) |
| Value | Count | Frequency (%) |
| united | 36040 | 5.8% |
| states | 35997 | 5.8% |
| of | 27839 | 4.5% |
| smithsonian | 24345 | 3.9% |
| 22476 | 3.6% | |
| institution | 20514 | 3.3% |
| national | 18658 | 3.0% |
| museum | 17521 | 2.8% |
| natural | 17241 | 2.8% |
| history | 17162 | 2.8% |
| Other values (2280) | 384193 |
Most occurring characters
| Value | Count | Frequency (%) |
| 509937 | 11.6% | |
| i | 263018 | 6.0% |
| a | 259974 | 5.9% |
| t | 236702 | 5.4% |
| n | 236587 | 5.4% |
| o | 217618 | 5.0% |
| e | 199155 | 4.5% |
| , | 179169 | 4.1% |
| r | 173333 | 4.0% |
| s | 169593 | 3.9% |
| Other values (73) | 1939197 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2458529 | |
| Uppercase Letter | 993786 | |
| Space Separator | 509937 | 11.6% |
| Other Punctuation | 274091 | 6.3% |
| Close Punctuation | 61757 | 1.4% |
| Open Punctuation | 61757 | 1.4% |
| Dash Punctuation | 23898 | 0.5% |
| Decimal Number | 518 | < 0.1% |
| Initial Punctuation | 5 | < 0.1% |
| Final Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 263018 | |
| a | 259974 | |
| t | 236702 | |
| n | 236587 | |
| o | 217618 | |
| e | 199155 | |
| r | 173333 | 7.1% |
| s | 169593 | 6.9% |
| l | 137018 | 5.6% |
| u | 122655 | 5.0% |
| Other values (27) | 442876 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 131779 | |
| S | 126036 | |
| E | 89193 | 9.0% |
| N | 81500 | 8.2% |
| I | 69915 | 7.0% |
| A | 68248 | 6.9% |
| D | 60242 | 6.1% |
| U | 50332 | 5.1% |
| M | 44673 | 4.5% |
| B | 34085 | 3.4% |
| Other values (18) | 237783 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 179169 | |
| . | 89197 | |
| ; | 4445 | 1.6% |
| ' | 576 | 0.2% |
| & | 430 | 0.2% |
| / | 274 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 148 | |
| 4 | 74 | |
| 6 | 74 | |
| 0 | 74 | |
| 1 | 74 | |
| 9 | 74 |
Space Separator
| Value | Count | Frequency (%) |
| 509937 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 61757 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 61757 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23898 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 5 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3452315 | |
| Common | 931968 | 21.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 263018 | 7.6% |
| a | 259974 | 7.5% |
| t | 236702 | 6.9% |
| n | 236587 | 6.9% |
| o | 217618 | 6.3% |
| e | 199155 | 5.8% |
| r | 173333 | 5.0% |
| s | 169593 | 4.9% |
| l | 137018 | 4.0% |
| T | 131779 | 3.8% |
| Other values (55) | 1427538 |
Common
| Value | Count | Frequency (%) |
| 509937 | ||
| , | 179169 | 19.2% |
| . | 89197 | 9.6% |
| ) | 61757 | 6.6% |
| ( | 61757 | 6.6% |
| - | 23898 | 2.6% |
| ; | 4445 | 0.5% |
| ' | 576 | 0.1% |
| & | 430 | < 0.1% |
| / | 274 | < 0.1% |
| Other values (8) | 528 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4383750 | |
| None | 523 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 509937 | 11.6% | |
| i | 263018 | 6.0% |
| a | 259974 | 5.9% |
| t | 236702 | 5.4% |
| n | 236587 | 5.4% |
| o | 217618 | 5.0% |
| e | 199155 | 4.5% |
| , | 179169 | 4.1% |
| r | 173333 | 4.0% |
| s | 169593 | 3.9% |
| Other values (58) | 1938664 |
None
| Value | Count | Frequency (%) |
| í | 212 | |
| ö | 128 | |
| á | 99 | |
| ø | 29 | 5.5% |
| ú | 26 | 5.0% |
| ó | 12 | 2.3% |
| Ø | 7 | 1.3% |
| ë | 3 | 0.6% |
| è | 3 | 0.6% |
| ñ | 1 | 0.2% |
| Other values (3) | 3 | 0.6% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 5 | |
| ” | 5 |
identifiedByID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 8 | |
| E | 8 | |
| A | 4 | |
| P | 4 | |
| T | 4 | |
| D | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 32 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 8 | |
| E | 8 | |
| A | 4 | |
| P | 4 | |
| T | 4 | |
| D | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 8 | |
| E | 8 | |
| A | 4 | |
| P | 4 | |
| T | 4 | |
| D | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 8 | |
| E | 8 | |
| A | 4 | |
| P | 4 | |
| T | 4 | |
| D | 4 |
identificationVerificationStatus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
|---|---|
| 2nd row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| 3rd row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| 4th row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| Value | Count | Frequency (%) |
| 26098c25-8f7f-4c71-97ac-1d3db181c65e | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 16 | |
| c | 16 | |
| - | 16 | |
| 8 | 12 | 8.3% |
| 7 | 12 | 8.3% |
| 2 | 8 | 5.6% |
| 6 | 8 | 5.6% |
| d | 8 | 5.6% |
| f | 8 | 5.6% |
| 5 | 8 | 5.6% |
| Other values (7) | 32 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 84 | |
| Lowercase Letter | 44 | |
| Dash Punctuation | 16 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16 | |
| 8 | 12 | |
| 7 | 12 | |
| 2 | 8 | |
| 6 | 8 | |
| 5 | 8 | |
| 9 | 8 | |
| 4 | 4 | 4.8% |
| 0 | 4 | 4.8% |
| 3 | 4 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 16 | |
| d | 8 | |
| f | 8 | |
| a | 4 | 9.1% |
| b | 4 | 9.1% |
| e | 4 | 9.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 100 | |
| Latin | 44 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 16 | |
| - | 16 | |
| 8 | 12 | |
| 7 | 12 | |
| 2 | 8 | |
| 6 | 8 | |
| 5 | 8 | |
| 9 | 8 | |
| 4 | 4 | 4.0% |
| 0 | 4 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| c | 16 | |
| d | 8 | |
| f | 8 | |
| a | 4 | 9.1% |
| b | 4 | 9.1% |
| e | 4 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 16 | |
| c | 16 | |
| - | 16 | |
| 8 | 12 | 8.3% |
| 7 | 12 | 8.3% |
| 2 | 8 | 5.6% |
| 6 | 8 | 5.6% |
| d | 8 | 5.6% |
| f | 8 | 5.6% |
| 5 | 8 | 5.6% |
| Other values (7) | 32 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| Value | Count | Frequency (%) |
| us | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 4 | |
| S | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4 | |
| S | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 4 | |
| S | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 4 | |
| S | 4 |
taxonID
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-01T12:07:17.508Z |
|---|---|
| 2nd row | 2024-12-01T12:07:28.759Z |
| 3rd row | 2024-12-01T12:07:38.231Z |
| 4th row | 2024-12-01T12:07:36.611Z |
| Value | Count | Frequency (%) |
| 2024-12-01t12:07:17.508z | 1 | |
| 2024-12-01t12:07:28.759z | 1 | |
| 2024-12-01t12:07:38.231z | 1 | |
| 2024-12-01t12:07:36.611z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 1 | 16 | |
| 0 | 13 | |
| - | 8 | |
| : | 8 | |
| 7 | 6 | 6.2% |
| 4 | 4 | 4.2% |
| T | 4 | 4.2% |
| . | 4 | 4.2% |
| Z | 4 | 4.2% |
| Other values (5) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68 | |
| Other Punctuation | 12 | 12.5% |
| Dash Punctuation | 8 | 8.3% |
| Uppercase Letter | 8 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 1 | 16 | |
| 0 | 13 | |
| 7 | 6 | 8.8% |
| 4 | 4 | 5.9% |
| 8 | 3 | 4.4% |
| 3 | 3 | 4.4% |
| 5 | 2 | 2.9% |
| 6 | 2 | 2.9% |
| 9 | 1 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 8 | |
| . | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 4 | |
| Z | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 88 | |
| Latin | 8 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 1 | 16 | |
| 0 | 13 | |
| - | 8 | |
| : | 8 | |
| 7 | 6 | 6.8% |
| 4 | 4 | 4.5% |
| . | 4 | 4.5% |
| 8 | 3 | 3.4% |
| 3 | 3 | 3.4% |
| Other values (3) | 5 | 5.7% |
Latin
| Value | Count | Frequency (%) |
| T | 4 | |
| Z | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 1 | 16 | |
| 0 | 13 | |
| - | 8 | |
| : | 8 | |
| 7 | 6 | 6.2% |
| 4 | 4 | 4.2% |
| T | 4 | 4.2% |
| . | 4 | 4.2% |
| Z | 4 | 4.2% |
| Other values (5) | 11 |
scientificNameID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338092 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 940.5 |
|---|---|
| 2nd row | 651.0 |
| Value | Count | Frequency (%) |
| 940.5 | 1 | |
| 651.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 2 | |
| 5 | 2 | |
| 9 | 1 | |
| 4 | 1 | |
| 6 | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 | |
| Other Punctuation | 2 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 2 | |
| 9 | 1 | |
| 4 | 1 | |
| 6 | 1 | |
| 1 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 2 | |
| 5 | 2 | |
| 9 | 1 | |
| 4 | 1 | |
| 6 | 1 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 2 | |
| 5 | 2 | |
| 9 | 1 | |
| 4 | 1 | |
| 6 | 1 | |
| 1 | 1 |
Missing 
| Distinct | 44952 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 6111 |
| Missing (%) | 1.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.770428004 |
| Min length | 1 |
Unique
| Unique | 9418 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | 10583418 |
|---|---|
| 2nd row | 5854277 |
| 3rd row | 5771 |
| 4th row | 4479 |
| 5th row | 2651085 |
| Value | Count | Frequency (%) |
| 6841 | 3252 | 1.0% |
| 637 | 2297 | 0.7% |
| 2285664 | 2008 | 0.6% |
| 2329589 | 2006 | 0.6% |
| 2440447 | 1919 | 0.6% |
| 8324617 | 1660 | 0.5% |
| 8770992 | 1474 | 0.4% |
| 2431491 | 1334 | 0.4% |
| 2307333 | 1035 | 0.3% |
| 68 | 875 | 0.3% |
| Other values (44942) | 314123 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 335749 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207520 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2247666 | |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 335749 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207520 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2247667 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 335749 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207520 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2247667 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 335749 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207520 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 104 |
|---|---|
| Median length | 92.5 |
| Mean length | 60.25 |
| Min length | 28 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | GEODETIC_DATUM_ASSUMED_WGS84;GEODETIC_DATUM_INVALID;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
|---|---|
| 2nd row | GEODETIC_DATUM_ASSUMED_WGS84 |
| 3rd row | GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
| 4th row | GEODETIC_DATUM_ASSUMED_WGS84 |
| Value | Count | Frequency (%) |
| geodetic_datum_assumed_wgs84 | 2 | |
| geodetic_datum_assumed_wgs84;geodetic_datum_invalid;continent_derived_from_coordinates;continent_invalid | 1 | |
| geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 24 | 10.0% |
| D | 23 | 9.5% |
| _ | 22 | 9.1% |
| T | 20 | 8.3% |
| I | 19 | 7.9% |
| N | 17 | 7.1% |
| O | 15 | 6.2% |
| S | 14 | 5.8% |
| A | 14 | 5.8% |
| M | 11 | 4.6% |
| Other values (11) | 62 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 206 | |
| Connector Punctuation | 22 | 9.1% |
| Decimal Number | 8 | 3.3% |
| Other Punctuation | 5 | 2.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 24 | |
| D | 23 | |
| T | 20 | |
| I | 19 | |
| N | 17 | |
| O | 15 | 7.3% |
| S | 14 | 6.8% |
| A | 14 | 6.8% |
| M | 11 | 5.3% |
| C | 11 | 5.3% |
| Other values (7) | 38 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 4 | |
| 4 | 4 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 22 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 206 | |
| Common | 35 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 24 | |
| D | 23 | |
| T | 20 | |
| I | 19 | |
| N | 17 | |
| O | 15 | 7.3% |
| S | 14 | 6.8% |
| A | 14 | 6.8% |
| M | 11 | 5.3% |
| C | 11 | 5.3% |
| Other values (7) | 38 |
Common
| Value | Count | Frequency (%) |
| _ | 22 | |
| ; | 5 | 14.3% |
| 8 | 4 | 11.4% |
| 4 | 4 | 11.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 24 | 10.0% |
| D | 23 | 9.5% |
| _ | 22 | 9.1% |
| T | 20 | 8.3% |
| I | 19 | 7.9% |
| N | 17 | 7.1% |
| O | 15 | 6.2% |
| S | 14 | 5.8% |
| A | 14 | 5.8% |
| M | 11 | 4.6% |
| Other values (11) | 62 |
scientificName
Text
| Distinct | 45747 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 136 |
|---|---|
| Median length | 95 |
| Mean length | 30.8157785 |
| Min length | 4 |
Unique
| Unique | 9824 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | Rectiostoma fernaldella |
|---|---|
| 2nd row | Siboglinidae |
| 3rd row | Amphinomidae |
| 4th row | Cambaridae |
| 5th row | Polystichum Roth |
| Value | Count | Frequency (%) |
| 37261 | 3.0% | |
| linnaeus | 10150 | 0.8% |
| 1758 | 7990 | 0.6% |
| l | 6159 | 0.5% |
| sedis | 6107 | 0.5% |
| incertae | 6107 | 0.5% |
| 1985 | 5118 | 0.4% |
| plethodon | 4673 | 0.4% |
| orconectes | 4548 | 0.4% |
| walker | 4503 | 0.4% |
| Other values (49822) | 1170241 |
Most occurring characters
| Value | Count | Frequency (%) |
| 924764 | 8.9% | |
| a | 838089 | 8.0% |
| e | 687702 | 6.6% |
| i | 638989 | 6.1% |
| r | 539087 | 5.2% |
| s | 536232 | 5.1% |
| o | 513067 | 4.9% |
| n | 469039 | 4.5% |
| l | 423628 | 4.1% |
| t | 373559 | 3.6% |
| Other values (97) | 4474443 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7079014 | |
| Decimal Number | 1044964 | 10.0% |
| Space Separator | 924764 | 8.9% |
| Uppercase Letter | 742652 | 7.1% |
| Other Punctuation | 379610 | 3.6% |
| Close Punctuation | 121557 | 1.2% |
| Open Punctuation | 121557 | 1.2% |
| Dash Punctuation | 4400 | < 0.1% |
| Math Symbol | 78 | < 0.1% |
| Connector Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 838089 | |
| e | 687702 | |
| i | 638989 | 9.0% |
| r | 539087 | 7.6% |
| s | 536232 | 7.6% |
| o | 513067 | 7.2% |
| n | 469039 | 6.6% |
| l | 423628 | 6.0% |
| t | 373559 | 5.3% |
| u | 350709 | 5.0% |
| Other values (43) | 1708913 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 65610 | 8.8% |
| C | 64603 | 8.7% |
| B | 58305 | 7.9% |
| S | 57140 | 7.7% |
| L | 52829 | 7.1% |
| M | 49682 | 6.7% |
| H | 47964 | 6.5% |
| A | 47366 | 6.4% |
| G | 43295 | 5.8% |
| D | 38406 | 5.2% |
| Other values (24) | 217452 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 307079 | |
| 8 | 201736 | |
| 9 | 143440 | |
| 7 | 71902 | 6.9% |
| 2 | 62368 | 6.0% |
| 0 | 62111 | 5.9% |
| 5 | 60348 | 5.8% |
| 6 | 50547 | 4.8% |
| 3 | 46539 | 4.5% |
| 4 | 38894 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 266361 | |
| . | 75661 | 19.9% |
| & | 37261 | 9.8% |
| ' | 327 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 924764 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 121557 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 121557 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4400 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 78 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7821666 | |
| Common | 2596933 | 24.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 838089 | 10.7% |
| e | 687702 | 8.8% |
| i | 638989 | 8.2% |
| r | 539087 | 6.9% |
| s | 536232 | 6.9% |
| o | 513067 | 6.6% |
| n | 469039 | 6.0% |
| l | 423628 | 5.4% |
| t | 373559 | 4.8% |
| u | 350709 | 4.5% |
| Other values (77) | 2451565 |
Common
| Value | Count | Frequency (%) |
| 924764 | ||
| 1 | 307079 | 11.8% |
| , | 266361 | 10.3% |
| 8 | 201736 | 7.8% |
| 9 | 143440 | 5.5% |
| ) | 121557 | 4.7% |
| ( | 121557 | 4.7% |
| . | 75661 | 2.9% |
| 7 | 71902 | 2.8% |
| 2 | 62368 | 2.4% |
| Other values (10) | 300508 | 11.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10395423 | |
| None | 23176 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 924764 | 8.9% | |
| a | 838089 | 8.1% |
| e | 687702 | 6.6% |
| i | 638989 | 6.1% |
| r | 539087 | 5.2% |
| s | 536232 | 5.2% |
| o | 513067 | 4.9% |
| n | 469039 | 4.5% |
| l | 423628 | 4.1% |
| t | 373559 | 3.6% |
| Other values (61) | 4451267 |
None
| Value | Count | Frequency (%) |
| ü | 7909 | |
| é | 6134 | |
| è | 2353 | 10.2% |
| ö | 1884 | 8.1% |
| å | 1557 | 6.7% |
| ä | 858 | 3.7% |
| ó | 719 | 3.1% |
| á | 407 | 1.8% |
| ø | 314 | 1.4% |
| É | 261 | 1.1% |
| Other values (26) | 780 | 3.4% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| Value | Count | Frequency (%) |
| false | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 4 | |
| a | 4 | |
| l | 4 | |
| s | 4 | |
| e | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 4 | |
| a | 4 | |
| l | 4 | |
| s | 4 | |
| e | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 4 | |
| a | 4 | |
| l | 4 | |
| s | 4 | |
| e | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 4 | |
| a | 4 | |
| l | 4 | |
| s | 4 | |
| e | 4 |
parentNameUsage
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2387143 |
|---|---|
| 2nd row | 2906907 |
| 3rd row | 2463461 |
| 4th row | 2974262 |
| Value | Count | Frequency (%) |
| 2387143 | 1 | |
| 2906907 | 1 | |
| 2463461 | 1 | |
| 2974262 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2387143 |
|---|---|
| 2nd row | 2906907 |
| 3rd row | 2463461 |
| 4th row | 2974262 |
| Value | Count | Frequency (%) |
| 2387143 | 1 | |
| 2906907 | 1 | |
| 2463461 | 1 | |
| 2974262 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 4 | 4 | |
| 6 | 4 | |
| 3 | 3 | |
| 7 | 3 | |
| 9 | 3 | |
| 1 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 8 | 1 | 3.6% |
nameAccordingTo
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 6 |
| 3rd row | 1 |
| 4th row | 6 |
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 6 | 2 |
namePublishedIn
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4.5 |
| Mean length | 4.5 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 7707728 |
| 3rd row | 44 |
| 4th row | 7707728 |
| Value | Count | Frequency (%) |
| 44 | 2 | |
| 7707728 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 4 | 4 | |
| 0 | 2 | 11.1% |
| 2 | 2 | 11.1% |
| 8 | 2 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 4 | 4 | |
| 0 | 2 | 11.1% |
| 2 | 2 | 11.1% |
| 8 | 2 | 11.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 4 | 4 | |
| 0 | 2 | 11.1% |
| 2 | 2 | 11.1% |
| 8 | 2 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 4 | 4 | |
| 0 | 2 | 11.1% |
| 2 | 2 | 11.1% |
| 8 | 2 | 11.1% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 4.666666667 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | 220 |
|---|---|
| 2nd row | 11592253 |
| 3rd row | 220 |
| Value | Count | Frequency (%) |
| 220 | 2 | |
| 11592253 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 0 | 2 | 14.3% |
| 1 | 2 | 14.3% |
| 5 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 0 | 2 | 14.3% |
| 1 | 2 | 14.3% |
| 5 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 0 | 2 | 14.3% |
| 1 | 2 | 14.3% |
| 5 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 0 | 2 | 14.3% |
| 1 | 2 | 14.3% |
| 5 | 2 | 14.3% |
| 9 | 1 | 7.1% |
| 3 | 1 | 7.1% |
Missing 
| Distinct | 4818 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 5891 |
| Missing (%) | 1.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 162 |
|---|---|
| Median length | 142 |
| Mean length | 76.55919724 |
| Min length | 3 |
Unique
| Unique | 462 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Arthropoda, Insecta, Lepidoptera, Depressariidae, Stenomatinae |
|---|---|
| 2nd row | Animalia, Annelida, Polychaeta, Sedentaria, Canalipalpata, Sabellida, Siboglinidae |
| 3rd row | Animalia, Annelida, Polychaeta, Errantia, Amphinomida, Amphinomidae |
| 4th row | Animalia, Arthropoda, Crustacea, Malacostraca, Eumalacostraca, Eucarida, Decapoda, Pleocyemata, Cambaridae |
| 5th row | Plantae, Pteridophyte, Polypodiales, Dryopteridaceae |
| Value | Count | Frequency (%) |
| animalia | 287414 | 13.0% |
| arthropoda | 145732 | 6.6% |
| insecta | 113112 | 5.1% |
| chordata | 103438 | 4.7% |
| vertebrata | 102398 | 4.6% |
| lepidoptera | 79682 | 3.6% |
| actinopterygii | 40707 | 1.8% |
| osteichthyes | 40705 | 1.8% |
| neopterygii | 40702 | 1.8% |
| plantae | 35513 | 1.6% |
| Other values (5331) | 1219943 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3309102 | |
| i | 2169092 | 8.5% |
| e | 2151600 | 8.5% |
| 1877143 | 7.4% | |
| , | 1874867 | 7.4% |
| t | 1538441 | 6.0% |
| r | 1525223 | 6.0% |
| o | 1481338 | 5.8% |
| n | 1000711 | 3.9% |
| d | 933466 | 3.7% |
| Other values (63) | 7572212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19468705 | |
| Uppercase Letter | 2207029 | 8.7% |
| Other Punctuation | 1878709 | 7.4% |
| Space Separator | 1877143 | 7.4% |
| Open Punctuation | 715 | < 0.1% |
| Close Punctuation | 715 | < 0.1% |
| Dash Punctuation | 127 | < 0.1% |
| Decimal Number | 40 | < 0.1% |
| Connector Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3309102 | |
| i | 2169092 | |
| e | 2151600 | |
| t | 1538441 | |
| r | 1525223 | |
| o | 1481338 | |
| n | 1000711 | 5.1% |
| d | 933466 | 4.8% |
| l | 864491 | 4.4% |
| c | 812570 | 4.2% |
| Other values (17) | 3682671 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 616321 | |
| C | 270051 | |
| P | 208229 | 9.4% |
| M | 124908 | 5.7% |
| I | 120801 | 5.5% |
| E | 116755 | 5.3% |
| L | 112347 | 5.1% |
| V | 112031 | 5.1% |
| S | 86356 | 3.9% |
| D | 72265 | 3.3% |
| Other values (16) | 366965 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9 | |
| 1 | 8 | |
| 0 | 7 | |
| 3 | 7 | |
| 9 | 3 | 7.5% |
| 7 | 2 | 5.0% |
| 4 | 1 | 2.5% |
| 2 | 1 | 2.5% |
| 5 | 1 | 2.5% |
| 8 | 1 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1874867 | |
| . | 3842 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 679 | |
| [ | 36 | 5.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 679 | |
| ] | 36 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| – | 124 | |
| - | 3 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1877143 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21675734 | |
| Common | 3757461 | 14.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3309102 | |
| i | 2169092 | 10.0% |
| e | 2151600 | 9.9% |
| t | 1538441 | 7.1% |
| r | 1525223 | 7.0% |
| o | 1481338 | 6.8% |
| n | 1000711 | 4.6% |
| d | 933466 | 4.3% |
| l | 864491 | 4.0% |
| c | 812570 | 3.7% |
| Other values (43) | 5889700 |
Common
| Value | Count | Frequency (%) |
| 1877143 | ||
| , | 1874867 | |
| . | 3842 | 0.1% |
| ( | 679 | < 0.1% |
| ) | 679 | < 0.1% |
| – | 124 | < 0.1% |
| [ | 36 | < 0.1% |
| ] | 36 | < 0.1% |
| _ | 12 | < 0.1% |
| 6 | 9 | < 0.1% |
| Other values (10) | 34 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25433053 | |
| Punctuation | 124 | < 0.1% |
| None | 18 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3309102 | |
| i | 2169092 | 8.5% |
| e | 2151600 | 8.5% |
| 1877143 | 7.4% | |
| , | 1874867 | 7.4% |
| t | 1538441 | 6.0% |
| r | 1525223 | 6.0% |
| o | 1481338 | 5.8% |
| n | 1000711 | 3.9% |
| d | 933466 | 3.7% |
| Other values (61) | 7572070 |
Punctuation
| Value | Count | Frequency (%) |
| – | 124 |
None
| Value | Count | Frequency (%) |
| ö | 18 |
kingdom
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.009370203 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| animalia | 291926 | |
| plantae | 35530 | 10.3% |
| incertae | 6107 | 1.8% |
| sedis | 6107 | 1.8% |
| chromista | 3038 | 0.9% |
| bacteria | 1166 | 0.3% |
| fungi | 322 | 0.1% |
| 8518 | 1 | < 0.1% |
| 8798 | 1 | < 0.1% |
| 9115 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 666389 | |
| i | 600592 | |
| n | 333885 | |
| l | 327456 | |
| m | 294964 | |
| A | 291926 | |
| e | 55017 | 2.0% |
| t | 45841 | 1.7% |
| P | 35530 | 1.3% |
| s | 15252 | 0.6% |
| Other values (18) | 41060 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2369807 | |
| Uppercase Letter | 331982 | 12.3% |
| Space Separator | 6107 | 0.2% |
| Decimal Number | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 666389 | |
| i | 600592 | |
| n | 333885 | |
| l | 327456 | |
| m | 294964 | |
| e | 55017 | 2.3% |
| t | 45841 | 1.9% |
| s | 15252 | 0.6% |
| r | 10311 | 0.4% |
| c | 7273 | 0.3% |
| Other values (5) | 12827 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 5 | |
| 5 | 3 | |
| 1 | 3 | |
| 9 | 2 | 12.5% |
| 7 | 1 | 6.2% |
| 3 | 1 | 6.2% |
| 6 | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 291926 | |
| P | 35530 | 10.7% |
| C | 3038 | 0.9% |
| B | 1166 | 0.4% |
| F | 322 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6107 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2701789 | |
| Common | 6123 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 666389 | |
| i | 600592 | |
| n | 333885 | |
| l | 327456 | |
| m | 294964 | |
| A | 291926 | |
| e | 55017 | 2.0% |
| t | 45841 | 1.7% |
| P | 35530 | 1.3% |
| s | 15252 | 0.6% |
| Other values (10) | 34937 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 6107 | ||
| 8 | 5 | 0.1% |
| 5 | 3 | < 0.1% |
| 1 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2707912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 666389 | |
| i | 600592 | |
| n | 333885 | |
| l | 327456 | |
| m | 294964 | |
| A | 291926 | |
| e | 55017 | 2.0% |
| t | 45841 | 1.7% |
| P | 35530 | 1.3% |
| s | 15252 | 0.6% |
| Other values (18) | 41060 | 1.5% |
phylum
Text
Missing 
| Distinct | 44 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6808 |
| Missing (%) | 2.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 9.353480075 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Annelida |
| 3rd row | Annelida |
| 4th row | Arthropoda |
| 5th row | Tracheophyta |
| Value | Count | Frequency (%) |
| arthropoda | 145971 | |
| chordata | 103372 | |
| tracheophyta | 30584 | 9.2% |
| mollusca | 20737 | 6.3% |
| annelida | 11327 | 3.4% |
| cnidaria | 3177 | 1.0% |
| rhodophyta | 2942 | 0.9% |
| myzozoa | 2110 | 0.6% |
| echinodermata | 1630 | 0.5% |
| chlorophyta | 1622 | 0.5% |
| Other values (34) | 7814 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 473882 | |
| o | 469081 | |
| r | 439672 | |
| h | 325528 | |
| t | 292348 | |
| d | 269484 | |
| p | 182549 | 5.9% |
| A | 157354 | 5.1% |
| C | 109510 | 3.5% |
| l | 56397 | 1.8% |
| Other values (35) | 322872 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2767367 | |
| Uppercase Letter | 331282 | 10.7% |
| Decimal Number | 28 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 473882 | |
| o | 469081 | |
| r | 439672 | |
| h | 325528 | |
| t | 292348 | |
| d | 269484 | |
| p | 182549 | 6.6% |
| l | 56397 | 2.0% |
| c | 56157 | 2.0% |
| e | 51319 | 1.9% |
| Other values (10) | 150950 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 157354 | |
| C | 109510 | |
| T | 30585 | 9.2% |
| M | 22851 | 6.9% |
| R | 2943 | 0.9% |
| P | 2127 | 0.6% |
| E | 1668 | 0.5% |
| N | 1361 | 0.4% |
| B | 1257 | 0.4% |
| O | 847 | 0.3% |
| Other values (6) | 779 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7 | |
| 7 | 4 | |
| 4 | 3 | |
| 6 | 3 | |
| 3 | 3 | |
| 0 | 2 | 7.1% |
| 9 | 2 | 7.1% |
| 8 | 2 | 7.1% |
| 1 | 2 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3098649 | |
| Common | 28 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 473882 | |
| o | 469081 | |
| r | 439672 | |
| h | 325528 | |
| t | 292348 | |
| d | 269484 | |
| p | 182549 | 5.9% |
| A | 157354 | 5.1% |
| C | 109510 | 3.5% |
| l | 56397 | 1.8% |
| Other values (26) | 322844 |
Common
| Value | Count | Frequency (%) |
| 2 | 7 | |
| 7 | 4 | |
| 4 | 3 | |
| 6 | 3 | |
| 3 | 3 | |
| 0 | 2 | 7.1% |
| 9 | 2 | 7.1% |
| 8 | 2 | 7.1% |
| 1 | 2 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3098677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 473882 | |
| o | 469081 | |
| r | 439672 | |
| h | 325528 | |
| t | 292348 | |
| d | 269484 | |
| p | 182549 | 5.9% |
| A | 157354 | 5.1% |
| C | 109510 | 3.5% |
| l | 56397 | 1.8% |
| Other values (35) | 322872 |
class
Text
Missing 
| Distinct | 105 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52277 |
| Missing (%) | 15.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 8.708561772 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Insecta |
|---|---|
| 2nd row | Polychaeta |
| 3rd row | Polychaeta |
| 4th row | Malacostraca |
| 5th row | Polypodiopsida |
| Value | Count | Frequency (%) |
| insecta | 112951 | |
| malacostraca | 27895 | 9.8% |
| mammalia | 24478 | 8.6% |
| amphibia | 18384 | 6.4% |
| magnoliopsida | 15795 | 5.5% |
| liliopsida | 10876 | 3.8% |
| polychaeta | 10686 | 3.7% |
| bivalvia | 9771 | 3.4% |
| gastropoda | 9525 | 3.3% |
| squamata | 9481 | 3.3% |
| Other values (95) | 35975 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 486650 | |
| s | 193470 | 7.8% |
| c | 193177 | 7.8% |
| t | 177830 | 7.1% |
| i | 170083 | 6.8% |
| e | 160201 | 6.4% |
| o | 142892 | 5.7% |
| n | 138528 | 5.6% |
| l | 116613 | 4.7% |
| I | 112951 | 4.5% |
| Other values (34) | 596660 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2203238 | |
| Uppercase Letter | 285817 | 11.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 486650 | |
| s | 193470 | 8.8% |
| c | 193177 | 8.8% |
| t | 177830 | 8.1% |
| i | 170083 | 7.7% |
| e | 160201 | 7.3% |
| o | 142892 | 6.5% |
| n | 138528 | 6.3% |
| l | 116613 | 5.3% |
| m | 81034 | 3.7% |
| Other values (14) | 342760 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 112951 | |
| M | 69115 | |
| A | 32386 | 11.3% |
| P | 15849 | 5.5% |
| L | 11211 | 3.9% |
| G | 10243 | 3.6% |
| S | 9874 | 3.5% |
| B | 9817 | 3.4% |
| C | 3459 | 1.2% |
| F | 2942 | 1.0% |
| Other values (10) | 7970 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2489055 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 486650 | |
| s | 193470 | 7.8% |
| c | 193177 | 7.8% |
| t | 177830 | 7.1% |
| i | 170083 | 6.8% |
| e | 160201 | 6.4% |
| o | 142892 | 5.7% |
| n | 138528 | 5.6% |
| l | 116613 | 4.7% |
| I | 112951 | 4.5% |
| Other values (34) | 596660 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2489055 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 486650 | |
| s | 193470 | 7.8% |
| c | 193177 | 7.8% |
| t | 177830 | 7.1% |
| i | 170083 | 6.8% |
| e | 160201 | 6.4% |
| o | 142892 | 5.7% |
| n | 138528 | 5.6% |
| l | 116613 | 4.7% |
| I | 112951 | 4.5% |
| Other values (34) | 596660 |
order
Text
Missing 
| Distinct | 534 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 30344 |
| Missing (%) | 9.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 19 |
| Mean length | 9.980555646 |
| Min length | 5 |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lepidoptera |
|---|---|
| 2nd row | Sabellida |
| 3rd row | Amphinomida |
| 4th row | Decapoda |
| 5th row | Polypodiales |
| Value | Count | Frequency (%) |
| lepidoptera | 79519 | |
| perciformes | 25783 | 8.4% |
| decapoda | 23755 | 7.7% |
| coleoptera | 10132 | 3.3% |
| anura | 10009 | 3.3% |
| hymenoptera | 8496 | 2.8% |
| rodentia | 8406 | 2.7% |
| caudata | 8204 | 2.7% |
| poales | 7858 | 2.6% |
| cetacea | 7808 | 2.5% |
| Other values (524) | 117780 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 434314 | |
| a | 364271 | |
| o | 268279 | 8.7% |
| r | 262893 | 8.6% |
| p | 253174 | 8.2% |
| i | 238211 | 7.8% |
| t | 176177 | 5.7% |
| d | 167228 | 5.4% |
| s | 113822 | 3.7% |
| l | 95900 | 3.1% |
| Other values (49) | 697247 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2763748 | |
| Uppercase Letter | 307747 | 10.0% |
| Decimal Number | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 434314 | |
| a | 364271 | |
| o | 268279 | |
| r | 262893 | |
| p | 253174 | |
| i | 238211 | |
| t | 176177 | |
| d | 167228 | 6.1% |
| s | 113822 | 4.1% |
| l | 95900 | 3.5% |
| Other values (16) | 389479 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 85088 | |
| P | 48522 | |
| C | 41463 | |
| D | 32883 | 10.7% |
| A | 26001 | 8.4% |
| H | 14781 | 4.8% |
| S | 14644 | 4.8% |
| R | 10170 | 3.3% |
| M | 7556 | 2.5% |
| V | 5610 | 1.8% |
| Other values (14) | 21029 | 6.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 6 | 3 | |
| 3 | 3 | |
| 4 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 2 | |
| 1 | 2 | |
| 8 | 1 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3071495 | |
| Common | 21 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 434314 | |
| a | 364271 | |
| o | 268279 | 8.7% |
| r | 262893 | 8.6% |
| p | 253174 | 8.2% |
| i | 238211 | 7.8% |
| t | 176177 | 5.7% |
| d | 167228 | 5.4% |
| s | 113822 | 3.7% |
| l | 95900 | 3.1% |
| Other values (40) | 697226 |
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 6 | 3 | |
| 3 | 3 | |
| 4 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 2 | |
| 1 | 2 | |
| 8 | 1 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3071516 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 434314 | |
| a | 364271 | |
| o | 268279 | 8.7% |
| r | 262893 | 8.6% |
| p | 253174 | 8.2% |
| i | 238211 | 7.8% |
| t | 176177 | 5.7% |
| d | 167228 | 5.4% |
| s | 113822 | 3.7% |
| l | 95900 | 3.1% |
| Other values (49) | 697247 |
superfamily
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 21 |
| Mean length | 21 |
| Min length | 19 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon nudivittis |
|---|---|
| 2nd row | Coccocypselum guianense |
| 3rd row | Emoia caeruleocauda |
| Value | Count | Frequency (%) |
| champsodon | 1 | |
| nudivittis | 1 | |
| coccocypselum | 1 | |
| guianense | 1 | |
| emoia | 1 | |
| caeruleocauda | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | 9.5% |
| o | 6 | 9.5% |
| u | 5 | 7.9% |
| e | 5 | 7.9% |
| i | 5 | 7.9% |
| c | 5 | 7.9% |
| s | 4 | 6.3% |
| n | 4 | 6.3% |
| m | 3 | 4.8% |
| d | 3 | 4.8% |
| Other values (11) | 17 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57 | |
| Space Separator | 3 | 4.8% |
| Uppercase Letter | 3 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 6 | |
| u | 5 | |
| e | 5 | |
| i | 5 | |
| c | 5 | |
| s | 4 | 7.0% |
| n | 4 | 7.0% |
| m | 3 | 5.3% |
| d | 3 | 5.3% |
| Other values (8) | 11 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 60 | |
| Common | 3 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 6 | |
| u | 5 | 8.3% |
| e | 5 | 8.3% |
| i | 5 | 8.3% |
| c | 5 | 8.3% |
| s | 4 | 6.7% |
| n | 4 | 6.7% |
| m | 3 | 5.0% |
| d | 3 | 5.0% |
| Other values (10) | 14 |
Common
| Value | Count | Frequency (%) |
| 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | 9.5% |
| o | 6 | 9.5% |
| u | 5 | 7.9% |
| e | 5 | 7.9% |
| i | 5 | 7.9% |
| c | 5 | 7.9% |
| s | 4 | 6.3% |
| n | 4 | 6.3% |
| m | 3 | 4.8% |
| d | 3 | 4.8% |
| Other values (11) | 17 |
family
Text
Missing 
| Distinct | 3097 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 19906 |
| Missing (%) | 5.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 20 |
| Mean length | 10.83802029 |
| Min length | 6 |
Unique
| Unique | 312 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Depressariidae |
|---|---|
| 2nd row | Siboglinidae |
| 3rd row | Amphinomidae |
| 4th row | Cambaridae |
| 5th row | Dryopteridaceae |
| Value | Count | Frequency (%) |
| cambaridae | 12102 | 3.8% |
| geometridae | 12012 | 3.8% |
| noctuidae | 7500 | 2.4% |
| tortricidae | 7246 | 2.3% |
| plethodontidae | 6784 | 2.1% |
| poaceae | 6677 | 2.1% |
| delphinidae | 5540 | 1.7% |
| erebidae | 5452 | 1.7% |
| siboglinidae | 5009 | 1.6% |
| vesicomyidae | 4930 | 1.5% |
| Other values (3098) | 244947 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 525485 | |
| a | 509708 | |
| i | 440350 | |
| d | 317748 | |
| r | 191641 | 5.6% |
| o | 190413 | 5.5% |
| c | 139936 | 4.1% |
| t | 125374 | 3.6% |
| l | 122376 | 3.5% |
| n | 101480 | 2.9% |
| Other values (52) | 784017 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3130303 | |
| Uppercase Letter | 318195 | 9.2% |
| Space Separator | 11 | < 0.1% |
| Decimal Number | 8 | < 0.1% |
| Other Punctuation | 5 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 525485 | |
| a | 509708 | |
| i | 440350 | |
| d | 317748 | |
| r | 191641 | 6.1% |
| o | 190413 | 6.1% |
| c | 139936 | 4.5% |
| t | 125374 | 4.0% |
| l | 122376 | 3.9% |
| n | 101480 | 3.2% |
| Other values (16) | 465792 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 53292 | |
| P | 43342 | |
| G | 29396 | |
| S | 26858 | |
| A | 22868 | 7.2% |
| T | 18614 | 5.8% |
| M | 18061 | 5.7% |
| D | 16080 | 5.1% |
| L | 13747 | 4.3% |
| N | 12781 | 4.0% |
| Other values (16) | 63156 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| 2 | 1 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3448498 | |
| Common | 30 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 525485 | |
| a | 509708 | |
| i | 440350 | |
| d | 317748 | |
| r | 191641 | 5.6% |
| o | 190413 | 5.5% |
| c | 139936 | 4.1% |
| t | 125374 | 3.6% |
| l | 122376 | 3.5% |
| n | 101480 | 2.9% |
| Other values (42) | 783987 |
Common
| Value | Count | Frequency (%) |
| 11 | ||
| ( | 3 | 10.0% |
| . | 3 | 10.0% |
| ) | 3 | 10.0% |
| , | 2 | 6.7% |
| 1 | 2 | 6.7% |
| 8 | 2 | 6.7% |
| 9 | 2 | 6.7% |
| 2 | 1 | 3.3% |
| 5 | 1 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3448528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 525485 | |
| a | 509708 | |
| i | 440350 | |
| d | 317748 | |
| r | 191641 | 5.6% |
| o | 190413 | 5.5% |
| c | 139936 | 4.1% |
| t | 125374 | 3.6% |
| l | 122376 | 3.5% |
| n | 101480 | 2.9% |
| Other values (52) | 784017 |
subfamily
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 19.75 |
| Min length | 16 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon nudivittis |
|---|---|
| 2nd row | Coccocypselum guianense |
| 3rd row | Emoia caeruleocauda |
| 4th row | Dimorphandra sp. |
| Value | Count | Frequency (%) |
| champsodon | 1 | |
| nudivittis | 1 | |
| coccocypselum | 1 | |
| guianense | 1 | |
| emoia | 1 | |
| caeruleocauda | 1 | |
| dimorphandra | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | 10.1% |
| o | 7 | 8.9% |
| i | 6 | 7.6% |
| c | 5 | 6.3% |
| s | 5 | 6.3% |
| n | 5 | 6.3% |
| u | 5 | 6.3% |
| e | 5 | 6.3% |
| m | 4 | 5.1% |
| p | 4 | 5.1% |
| Other values (13) | 25 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70 | |
| Space Separator | 4 | 5.1% |
| Uppercase Letter | 4 | 5.1% |
| Other Punctuation | 1 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| o | 7 | |
| i | 6 | 8.6% |
| c | 5 | 7.1% |
| s | 5 | 7.1% |
| n | 5 | 7.1% |
| u | 5 | 7.1% |
| e | 5 | 7.1% |
| m | 4 | 5.7% |
| p | 4 | 5.7% |
| Other values (8) | 16 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 1 | |
| D | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 74 | |
| Common | 5 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | 10.8% |
| o | 7 | 9.5% |
| i | 6 | 8.1% |
| c | 5 | 6.8% |
| s | 5 | 6.8% |
| n | 5 | 6.8% |
| u | 5 | 6.8% |
| e | 5 | 6.8% |
| m | 4 | 5.4% |
| p | 4 | 5.4% |
| Other values (11) | 20 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| . | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | 10.1% |
| o | 7 | 8.9% |
| i | 6 | 7.6% |
| c | 5 | 6.3% |
| s | 5 | 6.3% |
| n | 5 | 6.3% |
| u | 5 | 6.3% |
| e | 5 | 6.3% |
| m | 4 | 5.1% |
| p | 4 | 5.1% |
| Other values (13) | 25 |
subtribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| Value | Count | Frequency (%) |
| eml | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4 | |
| M | 4 | |
| L | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4 | |
| M | 4 | |
| L | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4 | |
| M | 4 | |
| L | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 4 | |
| M | 4 | |
| L | 4 |
genus
Text
Missing 
| Distinct | 19311 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 34392 |
| Missing (%) | 10.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 9.330027461 |
| Min length | 3 |
Unique
| Unique | 2143 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Rectiostoma |
|---|---|
| 2nd row | Polystichum |
| 3rd row | Mesontoplatys |
| 4th row | Dulcerana |
| 5th row | Amanses |
| Value | Count | Frequency (%) |
| plethodon | 4671 | 1.5% |
| faxonius | 4236 | 1.4% |
| procambarus | 3675 | 1.2% |
| bathymodiolus | 2587 | 0.9% |
| riftia | 2006 | 0.7% |
| tursiops | 1919 | 0.6% |
| cambarus | 1707 | 0.6% |
| delphinus | 1662 | 0.5% |
| aegla | 1424 | 0.5% |
| anolis | 1420 | 0.5% |
| Other values (19301) | 278395 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 312625 | 11.0% |
| o | 247355 | 8.7% |
| i | 219282 | 7.7% |
| s | 204987 | 7.2% |
| e | 199721 | 7.0% |
| r | 183312 | 6.5% |
| t | 142209 | 5.0% |
| l | 138064 | 4.9% |
| n | 131110 | 4.6% |
| u | 127288 | 4.5% |
| Other values (55) | 927595 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2529740 | |
| Uppercase Letter | 303712 | 10.7% |
| Decimal Number | 68 | < 0.1% |
| Dash Punctuation | 16 | < 0.1% |
| Other Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 312625 | |
| o | 247355 | 9.8% |
| i | 219282 | 8.7% |
| s | 204987 | 8.1% |
| e | 199721 | 7.9% |
| r | 183312 | 7.2% |
| t | 142209 | 5.6% |
| l | 138064 | 5.5% |
| n | 131110 | 5.2% |
| u | 127288 | 5.0% |
| Other values (16) | 623787 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 46871 | |
| C | 36089 | |
| A | 32417 | |
| S | 23721 | 7.8% |
| M | 19100 | 6.3% |
| E | 17748 | 5.8% |
| L | 15962 | 5.3% |
| H | 15384 | 5.1% |
| T | 14009 | 4.6% |
| D | 13393 | 4.4% |
| Other values (16) | 69018 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 1 | 16 | |
| 0 | 13 | |
| 7 | 6 | 8.8% |
| 4 | 4 | 5.9% |
| 3 | 3 | 4.4% |
| 8 | 3 | 4.4% |
| 5 | 2 | 2.9% |
| 6 | 2 | 2.9% |
| 9 | 1 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 8 | |
| . | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2833452 | |
| Common | 96 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 312625 | 11.0% |
| o | 247355 | 8.7% |
| i | 219282 | 7.7% |
| s | 204987 | 7.2% |
| e | 199721 | 7.0% |
| r | 183312 | 6.5% |
| t | 142209 | 5.0% |
| l | 138064 | 4.9% |
| n | 131110 | 4.6% |
| u | 127288 | 4.5% |
| Other values (42) | 927499 |
Common
| Value | Count | Frequency (%) |
| 2 | 18 | |
| - | 16 | |
| 1 | 16 | |
| 0 | 13 | |
| : | 8 | |
| 7 | 6 | 6.2% |
| 4 | 4 | 4.2% |
| . | 4 | 4.2% |
| 3 | 3 | 3.1% |
| 8 | 3 | 3.1% |
| Other values (3) | 5 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2833548 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 312625 | 11.0% |
| o | 247355 | 8.7% |
| i | 219282 | 7.7% |
| s | 204987 | 7.2% |
| e | 199721 | 7.0% |
| r | 183312 | 6.5% |
| t | 142209 | 5.0% |
| l | 138064 | 4.9% |
| n | 131110 | 4.6% |
| u | 127288 | 4.5% |
| Other values (55) | 927595 |
genericName
Text
Missing 
| Distinct | 19280 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 34393 |
| Missing (%) | 10.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 9.355540482 |
| Min length | 3 |
Unique
| Unique | 2065 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Rectiostoma |
|---|---|
| 2nd row | Polystichum |
| 3rd row | Mesontoplatys |
| 4th row | Bursa |
| 5th row | Amanses |
| Value | Count | Frequency (%) |
| plethodon | 4671 | 1.5% |
| orconectes | 4548 | 1.5% |
| procambarus | 3716 | 1.2% |
| bathymodiolus | 2598 | 0.9% |
| riftia | 2006 | 0.7% |
| tursiops | 1919 | 0.6% |
| cambarus | 1853 | 0.6% |
| delphinus | 1662 | 0.5% |
| aegla | 1424 | 0.5% |
| anolis | 1388 | 0.5% |
| Other values (19270) | 277916 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 309944 | 10.9% |
| o | 245779 | 8.7% |
| i | 214451 | 7.5% |
| e | 208150 | 7.3% |
| s | 204498 | 7.2% |
| r | 186229 | 6.6% |
| t | 147357 | 5.2% |
| l | 138695 | 4.9% |
| n | 130165 | 4.6% |
| u | 122278 | 4.3% |
| Other values (50) | 933741 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2537482 | |
| Uppercase Letter | 303711 | 10.7% |
| Decimal Number | 68 | < 0.1% |
| Dash Punctuation | 14 | < 0.1% |
| Other Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 309944 | |
| o | 245779 | 9.7% |
| i | 214451 | 8.5% |
| e | 208150 | 8.2% |
| s | 204498 | 8.1% |
| r | 186229 | 7.3% |
| t | 147357 | 5.8% |
| l | 138695 | 5.5% |
| n | 130165 | 5.1% |
| u | 122278 | 4.8% |
| Other values (16) | 629936 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 47262 | |
| C | 36122 | |
| A | 32888 | |
| S | 23249 | 7.7% |
| M | 18480 | 6.1% |
| E | 17680 | 5.8% |
| L | 16185 | 5.3% |
| H | 15736 | 5.2% |
| T | 13906 | 4.6% |
| D | 13383 | 4.4% |
| Other values (16) | 68820 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 28 | |
| 2 | 16 | |
| 0 | 12 | |
| 7 | 8 | 11.8% |
| 4 | 4 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 8 | |
| . | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2841193 | |
| Common | 94 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 309944 | 10.9% |
| o | 245779 | 8.7% |
| i | 214451 | 7.5% |
| e | 208150 | 7.3% |
| s | 204498 | 7.2% |
| r | 186229 | 6.6% |
| t | 147357 | 5.2% |
| l | 138695 | 4.9% |
| n | 130165 | 4.6% |
| u | 122278 | 4.3% |
| Other values (42) | 933647 |
Common
| Value | Count | Frequency (%) |
| 1 | 28 | |
| 2 | 16 | |
| - | 14 | |
| 0 | 12 | |
| : | 8 | 8.5% |
| 7 | 8 | 8.5% |
| 4 | 4 | 4.3% |
| . | 4 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2841287 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 309944 | 10.9% |
| o | 245779 | 8.7% |
| i | 214451 | 7.5% |
| e | 208150 | 7.3% |
| s | 204498 | 7.2% |
| r | 186229 | 6.6% |
| t | 147357 | 5.2% |
| l | 138695 | 4.9% |
| n | 130165 | 4.6% |
| u | 122278 | 4.3% |
| Other values (50) | 933741 |
subgenus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | true |
| Value | Count | Frequency (%) |
| true | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4 | |
| r | 4 | |
| u | 4 | |
| e | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4 | |
| r | 4 | |
| u | 4 | |
| e | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4 | |
| r | 4 | |
| u | 4 | |
| e | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4 | |
| r | 4 | |
| u | 4 | |
| e | 4 |
specificEpithet
Text
Missing 
| Distinct | 22410 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 89523 |
| Missing (%) | 26.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 8.891676825 |
| Min length | 2 |
Unique
| Unique | 3280 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | fernaldella |
|---|---|
| 2nd row | bolzi |
| 3rd row | granularis |
| 4th row | scopas |
| 5th row | extenta |
| Value | Count | Frequency (%) |
| truncatus | 1929 | 0.8% |
| cinereus | 1842 | 0.7% |
| delphis | 1660 | 0.7% |
| porphyriticus | 815 | 0.3% |
| acutus | 778 | 0.3% |
| opacum | 765 | 0.3% |
| hoffmani | 639 | 0.3% |
| maculatus | 632 | 0.3% |
| nigripes | 624 | 0.3% |
| carolinensis | 597 | 0.2% |
| Other values (22400) | 238290 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 289852 | |
| i | 243869 | |
| s | 189586 | 8.6% |
| e | 172279 | 7.8% |
| r | 150335 | 6.8% |
| l | 148609 | 6.7% |
| u | 138502 | 6.3% |
| n | 137838 | 6.2% |
| t | 126139 | 5.7% |
| o | 112282 | 5.1% |
| Other values (18) | 500922 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2210057 | |
| Dash Punctuation | 156 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 289852 | |
| i | 243869 | |
| s | 189586 | 8.6% |
| e | 172279 | 7.8% |
| r | 150335 | 6.8% |
| l | 148609 | 6.7% |
| u | 138502 | 6.3% |
| n | 137838 | 6.2% |
| t | 126139 | 5.7% |
| o | 112282 | 5.1% |
| Other values (17) | 500766 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2210057 | |
| Common | 156 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 289852 | |
| i | 243869 | |
| s | 189586 | 8.6% |
| e | 172279 | 7.8% |
| r | 150335 | 6.8% |
| l | 148609 | 6.7% |
| u | 138502 | 6.3% |
| n | 137838 | 6.2% |
| t | 126139 | 5.7% |
| o | 112282 | 5.1% |
| Other values (17) | 500766 |
Common
| Value | Count | Frequency (%) |
| - | 156 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2210212 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 289852 | |
| i | 243869 | |
| s | 189586 | 8.6% |
| e | 172279 | 7.8% |
| r | 150335 | 6.8% |
| l | 148609 | 6.7% |
| u | 138502 | 6.3% |
| n | 137838 | 6.2% |
| t | 126139 | 5.7% |
| o | 112282 | 5.1% |
| Other values (17) | 500921 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
Missing 
| Distinct | 1598 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 328999 |
| Missing (%) | 97.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 9.015063222 |
| Min length | 3 |
Unique
| Unique | 647 ? |
|---|---|
| Unique (%) | 7.1% |
Sample
| 1st row | cinereus |
|---|---|
| 2nd row | benjamina |
| 3rd row | mexicana |
| 4th row | doliatus |
| 5th row | pallidirostris |
| Value | Count | Frequency (%) |
| pennsylvanicus | 615 | 6.8% |
| cinereus | 494 | 5.4% |
| talpoides | 246 | 2.7% |
| melas | 245 | 2.7% |
| dickeyi | 167 | 1.8% |
| meeki | 106 | 1.2% |
| porteri | 91 | 1.0% |
| fumeus | 88 | 1.0% |
| parva | 74 | 0.8% |
| couguar | 61 | 0.7% |
| Other values (1588) | 6908 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 9129 | |
| a | 8963 | |
| s | 8483 | |
| e | 7210 | 8.8% |
| n | 6911 | 8.4% |
| u | 5147 | 6.3% |
| r | 5031 | 6.1% |
| l | 4672 | 5.7% |
| c | 4450 | 5.4% |
| o | 3740 | 4.6% |
| Other values (17) | 18256 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 81989 | |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 9129 | |
| a | 8963 | |
| s | 8483 | |
| e | 7210 | 8.8% |
| n | 6911 | 8.4% |
| u | 5147 | 6.3% |
| r | 5031 | 6.1% |
| l | 4672 | 5.7% |
| c | 4450 | 5.4% |
| o | 3740 | 4.6% |
| Other values (16) | 18253 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 81989 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 9129 | |
| a | 8963 | |
| s | 8483 | |
| e | 7210 | 8.8% |
| n | 6911 | 8.4% |
| u | 5147 | 6.3% |
| r | 5031 | 6.1% |
| l | 4672 | 5.7% |
| c | 4450 | 5.4% |
| o | 3740 | 4.6% |
| Other values (16) | 18253 |
Common
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 9129 | |
| a | 8963 | |
| s | 8483 | |
| e | 7210 | 8.8% |
| n | 6911 | 8.4% |
| u | 5147 | 6.3% |
| r | 5031 | 6.1% |
| l | 4672 | 5.7% |
| c | 4450 | 5.4% |
| o | 3740 | 4.6% |
| Other values (17) | 18256 |
cultivarEpithet
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.25 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | ASIA |
|---|---|
| 2nd row | LATIN_AMERICA |
| 3rd row | OCEANIA |
| 4th row | LATIN_AMERICA |
| Value | Count | Frequency (%) |
| latin_america | 2 | |
| asia | 1 | |
| oceania | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10 | |
| I | 6 | |
| N | 3 | 8.1% |
| E | 3 | 8.1% |
| C | 3 | 8.1% |
| L | 2 | 5.4% |
| T | 2 | 5.4% |
| _ | 2 | 5.4% |
| M | 2 | 5.4% |
| R | 2 | 5.4% |
| Other values (2) | 2 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 35 | |
| Connector Punctuation | 2 | 5.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10 | |
| I | 6 | |
| N | 3 | 8.6% |
| E | 3 | 8.6% |
| C | 3 | 8.6% |
| L | 2 | 5.7% |
| T | 2 | 5.7% |
| M | 2 | 5.7% |
| R | 2 | 5.7% |
| S | 1 | 2.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 35 | |
| Common | 2 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10 | |
| I | 6 | |
| N | 3 | 8.6% |
| E | 3 | 8.6% |
| C | 3 | 8.6% |
| L | 2 | 5.7% |
| T | 2 | 5.7% |
| M | 2 | 5.7% |
| R | 2 | 5.7% |
| S | 1 | 2.9% |
Common
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10 | |
| I | 6 | |
| N | 3 | 8.1% |
| E | 3 | 8.1% |
| C | 3 | 8.1% |
| L | 2 | 5.4% |
| T | 2 | 5.4% |
| _ | 2 | 5.4% |
| M | 2 | 5.4% |
| R | 2 | 5.4% |
| Other values (2) | 2 | 5.4% |
taxonRank
Text
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.638623101 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | FAMILY |
| 3rd row | FAMILY |
| 4th row | FAMILY |
| 5th row | GENUS |
| Value | Count | Frequency (%) |
| species | 239484 | |
| genus | 55123 | 16.3% |
| family | 14870 | 4.4% |
| subspecies | 8236 | 2.4% |
| kingdom | 6690 | 2.0% |
| order | 4994 | 1.5% |
| phylum | 4014 | 1.2% |
| class | 3823 | 1.1% |
| variety | 806 | 0.2% |
| form | 49 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 566445 | |
| E | 556367 | |
| I | 270090 | |
| P | 251734 | |
| C | 251547 | |
| U | 67373 | 3.0% |
| N | 61817 | 2.8% |
| G | 61813 | 2.8% |
| M | 25627 | 1.1% |
| L | 22707 | 1.0% |
| Other values (12) | 108952 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2244468 | |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 566445 | |
| E | 556367 | |
| I | 270090 | |
| P | 251734 | |
| C | 251547 | |
| U | 67373 | 3.0% |
| N | 61817 | 2.8% |
| G | 61813 | 2.8% |
| M | 25627 | 1.1% |
| L | 22707 | 1.0% |
| Other values (11) | 108948 | 4.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2244468 | |
| Common | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 566445 | |
| E | 556367 | |
| I | 270090 | |
| P | 251734 | |
| C | 251547 | |
| U | 67373 | 3.0% |
| N | 61817 | 2.8% |
| G | 61813 | 2.8% |
| M | 25627 | 1.1% |
| L | 22707 | 1.0% |
| Other values (11) | 108948 | 4.9% |
Common
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2244472 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 566445 | |
| E | 556367 | |
| I | 270090 | |
| P | 251734 | |
| C | 251547 | |
| U | 67373 | 3.0% |
| N | 61817 | 2.8% |
| G | 61813 | 2.8% |
| M | 25627 | 1.1% |
| L | 22707 | 1.0% |
| Other values (12) | 108952 | 4.9% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | PHL |
|---|---|
| 2nd row | GUY |
| 3rd row | PLW |
| 4th row | GUY |
| Value | Count | Frequency (%) |
| guy | 2 | |
| phl | 1 | |
| plw | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 2 | |
| L | 2 | |
| H | 1 | |
| W | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 2 | |
| L | 2 | |
| H | 1 | |
| W | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 2 | |
| L | 2 | |
| H | 1 | |
| W | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 2 | |
| L | 2 | |
| H | 1 | |
| W | 1 |
vernacularName
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8.5 |
| Mean length | 7 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | Philippines |
|---|---|
| 2nd row | Guyana |
| 3rd row | Palau |
| 4th row | Guyana |
| Value | Count | Frequency (%) |
| guyana | 2 | |
| philippines | 1 | |
| palau | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| u | 3 | |
| n | 3 | |
| i | 3 | |
| G | 2 | 7.1% |
| y | 2 | 7.1% |
| P | 2 | 7.1% |
| l | 2 | 7.1% |
| p | 2 | 7.1% |
| h | 1 | 3.6% |
| Other values (2) | 2 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 4 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| u | 3 | |
| n | 3 | |
| i | 3 | |
| y | 2 | 8.3% |
| l | 2 | 8.3% |
| p | 2 | 8.3% |
| h | 1 | 4.2% |
| e | 1 | 4.2% |
| s | 1 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| u | 3 | |
| n | 3 | |
| i | 3 | |
| G | 2 | 7.1% |
| y | 2 | 7.1% |
| P | 2 | 7.1% |
| l | 2 | 7.1% |
| p | 2 | 7.1% |
| h | 1 | 3.6% |
| Other values (2) | 2 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| u | 3 | |
| n | 3 | |
| i | 3 | |
| G | 2 | 7.1% |
| y | 2 | 7.1% |
| P | 2 | 7.1% |
| l | 2 | 7.1% |
| p | 2 | 7.1% |
| h | 1 | 3.6% |
| Other values (2) | 2 | 7.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 338090 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.25 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | PHL.36_1 |
|---|---|
| 2nd row | GUY.2_1 |
| 3rd row | PLW.6_1 |
| 4th row | GUY.2_1 |
| Value | Count | Frequency (%) |
| guy.2_1 | 2 | |
| phl.36_1 | 1 | |
| plw.6_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4 | |
| _ | 4 | |
| 1 | 4 | |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| 2 | 2 | |
| P | 2 | |
| L | 2 | |
| 6 | 2 | |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 | |
| Decimal Number | 9 | |
| Other Punctuation | 4 | 13.8% |
| Connector Punctuation | 4 | 13.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 2 | |
| L | 2 | |
| H | 1 | |
| W | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 2 | 2 | |
| 6 | 2 | |
| 3 | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17 | |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 2 | |
| L | 2 | |
| H | 1 | |
| W | 1 |
Common
| Value | Count | Frequency (%) |
| . | 4 | |
| _ | 4 | |
| 1 | 4 | |
| 2 | 2 | |
| 6 | 2 | |
| 3 | 1 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4 | |
| _ | 4 | |
| 1 | 4 | |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| 2 | 2 | |
| P | 2 | |
| L | 2 | |
| 6 | 2 | |
| Other values (3) | 3 |
taxonomicStatus
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6108 |
| Missing (%) | 1.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 7.926999934 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 305150 | |
| synonym | 24244 | 7.3% |
| doubtful | 2588 | 0.8% |
| cuyuni-mazaruni | 2 | < 0.1% |
| iloilo | 1 | < 0.1% |
| koror | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 610302 | |
| E | 610300 | |
| T | 307738 | |
| D | 307738 | |
| A | 305150 | |
| P | 305150 | |
| Y | 48488 | 1.8% |
| N | 48488 | 1.8% |
| O | 26832 | 1.0% |
| M | 24246 | 0.9% |
| Other values (17) | 37221 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2631618 | |
| Lowercase Letter | 33 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 610302 | |
| E | 610300 | |
| T | 307738 | |
| D | 307738 | |
| A | 305150 | |
| P | 305150 | |
| Y | 48488 | 1.8% |
| N | 48488 | 1.8% |
| O | 26832 | 1.0% |
| M | 24246 | 0.9% |
| Other values (7) | 37186 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 6 | |
| i | 5 | |
| n | 4 | |
| a | 4 | |
| r | 4 | |
| o | 4 | |
| y | 2 | 6.1% |
| z | 2 | 6.1% |
| l | 2 | 6.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2631651 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 610302 | |
| E | 610300 | |
| T | 307738 | |
| D | 307738 | |
| A | 305150 | |
| P | 305150 | |
| Y | 48488 | 1.8% |
| N | 48488 | 1.8% |
| O | 26832 | 1.0% |
| M | 24246 | 0.9% |
| Other values (16) | 37219 | 1.4% |
Common
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2631653 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 610302 | |
| E | 610300 | |
| T | 307738 | |
| D | 307738 | |
| A | 305150 | |
| P | 305150 | |
| Y | 48488 | 1.8% |
| N | 48488 | 1.8% |
| O | 26832 | 1.0% |
| M | 24246 | 0.9% |
| Other values (17) | 37221 | 1.4% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.666666667 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | PHL.36.21_1 |
|---|---|
| 2nd row | GUY.2.8_1 |
| 3rd row | GUY.2.8_1 |
| Value | Count | Frequency (%) |
| guy.2.8_1 | 2 | |
| phl.36.21_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6 | |
| 1 | 4 | |
| 2 | 3 | |
| _ | 3 | |
| G | 2 | 6.9% |
| U | 2 | 6.9% |
| Y | 2 | 6.9% |
| 8 | 2 | 6.9% |
| P | 1 | 3.4% |
| H | 1 | 3.4% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11 | |
| Uppercase Letter | 9 | |
| Other Punctuation | 6 | |
| Connector Punctuation | 3 | 10.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 2 | 3 | |
| 8 | 2 | |
| 3 | 1 | 9.1% |
| 6 | 1 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20 | |
| Latin | 9 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 6 | |
| 1 | 4 | |
| 2 | 3 | |
| _ | 3 | |
| 8 | 2 | 10.0% |
| 3 | 1 | 5.0% |
| 6 | 1 | 5.0% |
Latin
| Value | Count | Frequency (%) |
| G | 2 | |
| U | 2 | |
| Y | 2 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6 | |
| 1 | 4 | |
| 2 | 3 | |
| _ | 3 | |
| G | 2 | 6.9% |
| U | 2 | 6.9% |
| Y | 2 | 6.9% |
| 8 | 2 | 6.9% |
| P | 1 | 3.4% |
| H | 1 | 3.4% |
| Other values (3) | 3 |
taxonRemarks
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 338091 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.33333333 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | Iloilo City |
|---|---|
| 2nd row | Rest of Region 7 |
| 3rd row | Rest of Region 7 |
| Value | Count | Frequency (%) |
| rest | 2 | |
| of | 2 | |
| region | 2 | |
| 7 | 2 | |
| iloilo | 1 | |
| city | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | ||
| o | 6 | |
| R | 4 | |
| e | 4 | |
| i | 4 | |
| t | 3 | |
| s | 2 | 4.7% |
| f | 2 | 4.7% |
| g | 2 | 4.7% |
| n | 2 | 4.7% |
| Other values (5) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28 | |
| Space Separator | 7 | 16.3% |
| Uppercase Letter | 6 | 14.0% |
| Decimal Number | 2 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| e | 4 | |
| i | 4 | |
| t | 3 | |
| s | 2 | 7.1% |
| f | 2 | 7.1% |
| g | 2 | 7.1% |
| n | 2 | 7.1% |
| l | 2 | 7.1% |
| y | 1 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 4 | |
| I | 1 | 16.7% |
| C | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34 | |
| Common | 9 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6 | |
| R | 4 | |
| e | 4 | |
| i | 4 | |
| t | 3 | |
| s | 2 | 5.9% |
| f | 2 | 5.9% |
| g | 2 | 5.9% |
| n | 2 | 5.9% |
| l | 2 | 5.9% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| 7 | 2 | 22.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | ||
| o | 6 | |
| R | 4 | |
| e | 4 | |
| i | 4 | |
| t | 3 | |
| s | 2 | 4.7% |
| f | 2 | 4.7% |
| g | 2 | 4.7% |
| n | 2 | 4.7% |
| Other values (5) | 7 |
datasetKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 35.99993493 |
| Min length | 14 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
|---|---|
| 2nd row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| 3rd row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| 4th row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| 5th row | 26098c25-8f7f-4c71-97ac-1d3db181c65e |
| Value | Count | Frequency (%) |
| 26098c25-8f7f-4c71-97ac-1d3db181c65e | 338089 | |
| phl.36.21.66_1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1352358 | |
| - | 1352356 | |
| c | 1352356 | |
| 7 | 1014267 | 8.3% |
| 8 | 1014267 | 8.3% |
| 6 | 676181 | 5.6% |
| 2 | 676179 | 5.6% |
| 5 | 676178 | 5.6% |
| f | 676178 | 5.6% |
| 9 | 676178 | 5.6% |
| Other values (12) | 2704720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7099876 | |
| Lowercase Letter | 3718979 | |
| Dash Punctuation | 1352356 | 11.1% |
| Other Punctuation | 3 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1352358 | |
| 7 | 1014267 | |
| 8 | 1014267 | |
| 6 | 676181 | |
| 2 | 676179 | |
| 5 | 676178 | |
| 9 | 676178 | |
| 3 | 338090 | 4.8% |
| 4 | 338089 | 4.8% |
| 0 | 338089 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 1352356 | |
| f | 676178 | |
| d | 676178 | |
| a | 338089 | 9.1% |
| b | 338089 | 9.1% |
| e | 338089 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| H | 1 | |
| L | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1352356 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8452236 | |
| Latin | 3718982 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1352358 | |
| - | 1352356 | |
| 7 | 1014267 | |
| 8 | 1014267 | |
| 6 | 676181 | |
| 2 | 676179 | |
| 5 | 676178 | |
| 9 | 676178 | |
| 3 | 338090 | 4.0% |
| 4 | 338089 | 4.0% |
| Other values (3) | 338093 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| c | 1352356 | |
| f | 676178 | |
| d | 676178 | |
| a | 338089 | 9.1% |
| b | 338089 | 9.1% |
| e | 338089 | 9.1% |
| P | 1 | < 0.1% |
| H | 1 | < 0.1% |
| L | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12171218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1352358 | |
| - | 1352356 | |
| c | 1352356 | |
| 7 | 1014267 | 8.3% |
| 8 | 1014267 | 8.3% |
| 6 | 676181 | 5.6% |
| 2 | 676179 | 5.6% |
| 5 | 676178 | 5.6% |
| f | 676178 | 5.6% |
| 9 | 676178 | 5.6% |
| Other values (12) | 2704720 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.000020705 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 338089 | |
| kahirupan | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 338089 | |
| S | 338089 | |
| a | 2 | < 0.1% |
| K | 1 | < 0.1% |
| h | 1 | < 0.1% |
| i | 1 | < 0.1% |
| r | 1 | < 0.1% |
| u | 1 | < 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 676179 | |
| Lowercase Letter | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 1 | |
| i | 1 | |
| r | 1 | |
| u | 1 | |
| p | 1 | |
| n | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 338089 | |
| S | 338089 | |
| K | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 676187 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 338089 | |
| S | 338089 | |
| a | 2 | < 0.1% |
| K | 1 | < 0.1% |
| h | 1 | < 0.1% |
| i | 1 | < 0.1% |
| r | 1 | < 0.1% |
| u | 1 | < 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 676187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 338089 | |
| S | 338089 | |
| a | 2 | < 0.1% |
| K | 1 | < 0.1% |
| h | 1 | < 0.1% |
| i | 1 | < 0.1% |
| r | 1 | < 0.1% |
| u | 1 | < 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
lastInterpreted
Text
| Distinct | 31889 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.9958118 |
| Min length | 2 |
Unique
| Unique | 3501 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 2024-12-01T12:07:01.240Z |
|---|---|
| 2nd row | 2024-12-01T12:07:01.438Z |
| 3rd row | 2024-12-01T12:07:01.443Z |
| 4th row | 2024-12-01T12:07:01.449Z |
| 5th row | 2024-12-01T12:07:01.465Z |
| Value | Count | Frequency (%) |
| 2024-12-01t12:07:38.532z | 73 | < 0.1% |
| 2024-12-01t12:07:38.533z | 71 | < 0.1% |
| 2024-12-01t12:07:38.508z | 68 | < 0.1% |
| 2024-12-01t12:07:39.879z | 67 | < 0.1% |
| 2024-12-01t12:07:37.936z | 65 | < 0.1% |
| 2024-12-01t12:07:40.339z | 65 | < 0.1% |
| 2024-12-01t12:07:39.819z | 65 | < 0.1% |
| 2024-12-01t12:07:39.723z | 64 | < 0.1% |
| 2024-12-01t12:07:39.875z | 64 | < 0.1% |
| 2024-12-01t12:07:38.854z | 63 | < 0.1% |
| Other values (31879) | 337428 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| - | 676178 | |
| : | 676178 | |
| 4 | 503489 | 6.2% |
| 7 | 475460 | 5.9% |
| Z | 338089 | 4.2% |
| T | 338089 | 4.2% |
| . | 337757 | 4.2% |
| Other values (9) | 866878 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5746517 | |
| Other Punctuation | 1013935 | 12.5% |
| Uppercase Letter | 676186 | 8.3% |
| Dash Punctuation | 676178 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| 4 | 503489 | 8.8% |
| 7 | 475460 | 8.3% |
| 3 | 314692 | 5.5% |
| 8 | 145833 | 2.5% |
| 9 | 145808 | 2.5% |
| 6 | 137824 | 2.4% |
| 5 | 122713 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 338089 | |
| T | 338089 | |
| N | 3 | < 0.1% |
| E | 3 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676178 | |
| . | 337757 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 676178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7436630 | |
| Latin | 676186 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| - | 676178 | |
| : | 676178 | |
| 4 | 503489 | 6.8% |
| 7 | 475460 | 6.4% |
| . | 337757 | 4.5% |
| 3 | 314692 | 4.2% |
| 8 | 145833 | 2.0% |
| Other values (3) | 406345 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| Z | 338089 | |
| T | 338089 | |
| N | 3 | < 0.1% |
| E | 3 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8112816 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| - | 676178 | |
| : | 676178 | |
| 4 | 503489 | 6.2% |
| 7 | 475460 | 5.9% |
| Z | 338089 | 4.2% |
| T | 338089 | 4.2% |
| . | 337757 | 4.2% |
| Other values (9) | 866878 |
elevation
Text
Missing 
| Distinct | 2734 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 248950 |
| Missing (%) | 73.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.182614646 |
| Min length | 3 |
Unique
| Unique | 260 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 1524.0 |
|---|---|
| 2nd row | 2700.0 |
| 3rd row | 1800.0 |
| 4th row | 1000.0 |
| 5th row | 760.0 |
| Value | Count | Frequency (%) |
| 5.0 | 1545 | 1.7% |
| 1100.0 | 1195 | 1.3% |
| 1200.0 | 995 | 1.1% |
| 150.0 | 967 | 1.1% |
| 200.0 | 779 | 0.9% |
| 1829.0 | 757 | 0.8% |
| 50.0 | 700 | 0.8% |
| 300.0 | 694 | 0.8% |
| 1487.0 | 632 | 0.7% |
| 100.0 | 612 | 0.7% |
| Other values (2721) | 80268 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 137608 | |
| . | 89144 | |
| 1 | 52547 | 11.4% |
| 2 | 36028 | 7.8% |
| 5 | 32914 | 7.1% |
| 3 | 23010 | 5.0% |
| 4 | 21188 | 4.6% |
| 7 | 19318 | 4.2% |
| 6 | 17558 | 3.8% |
| 8 | 17037 | 3.7% |
| Other values (2) | 15647 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 372843 | |
| Other Punctuation | 89144 | 19.3% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 137608 | |
| 1 | 52547 | 14.1% |
| 2 | 36028 | 9.7% |
| 5 | 32914 | 8.8% |
| 3 | 23010 | 6.2% |
| 4 | 21188 | 5.7% |
| 7 | 19318 | 5.2% |
| 6 | 17558 | 4.7% |
| 8 | 17037 | 4.6% |
| 9 | 15635 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 89144 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 461999 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 137608 | |
| . | 89144 | |
| 1 | 52547 | 11.4% |
| 2 | 36028 | 7.8% |
| 5 | 32914 | 7.1% |
| 3 | 23010 | 5.0% |
| 4 | 21188 | 4.6% |
| 7 | 19318 | 4.2% |
| 6 | 17558 | 3.8% |
| 8 | 17037 | 3.7% |
| Other values (2) | 15647 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 461999 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 137608 | |
| . | 89144 | |
| 1 | 52547 | 11.4% |
| 2 | 36028 | 7.8% |
| 5 | 32914 | 7.1% |
| 3 | 23010 | 5.0% |
| 4 | 21188 | 4.6% |
| 7 | 19318 | 4.2% |
| 6 | 17558 | 3.8% |
| 8 | 17037 | 3.7% |
| Other values (2) | 15647 | 3.4% |
Missing 
| Distinct | 161 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 284393 |
| Missing (%) | 84.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.090705946 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 47745 | |
| 2.5 | 1013 | 1.9% |
| 25.0 | 410 | 0.8% |
| 100.0 | 392 | 0.7% |
| 1.0 | 356 | 0.7% |
| 5.0 | 307 | 0.6% |
| 12.5 | 218 | 0.4% |
| 50.0 | 212 | 0.4% |
| 38.75 | 206 | 0.4% |
| 30.5 | 167 | 0.3% |
| Other values (151) | 2675 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 100330 | |
| . | 53701 | |
| 5 | 4596 | 2.8% |
| 2 | 2430 | 1.5% |
| 1 | 2117 | 1.3% |
| 7 | 818 | 0.5% |
| 3 | 727 | 0.4% |
| 8 | 446 | 0.3% |
| 4 | 336 | 0.2% |
| 9 | 237 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 112273 | |
| Other Punctuation | 53701 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 100330 | |
| 5 | 4596 | 4.1% |
| 2 | 2430 | 2.2% |
| 1 | 2117 | 1.9% |
| 7 | 818 | 0.7% |
| 3 | 727 | 0.6% |
| 8 | 446 | 0.4% |
| 4 | 336 | 0.3% |
| 9 | 237 | 0.2% |
| 6 | 236 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 53701 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 165974 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 100330 | |
| . | 53701 | |
| 5 | 4596 | 2.8% |
| 2 | 2430 | 1.5% |
| 1 | 2117 | 1.3% |
| 7 | 818 | 0.5% |
| 3 | 727 | 0.4% |
| 8 | 446 | 0.3% |
| 4 | 336 | 0.2% |
| 9 | 237 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 165974 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 100330 | |
| . | 53701 | |
| 5 | 4596 | 2.8% |
| 2 | 2430 | 1.5% |
| 1 | 2117 | 1.3% |
| 7 | 818 | 0.5% |
| 3 | 727 | 0.4% |
| 8 | 446 | 0.3% |
| 4 | 336 | 0.2% |
| 9 | 237 | 0.1% |
depth
Text
Missing 
| Distinct | 2253 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 262666 |
| Missing (%) | 77.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 4.257742483 |
| Min length | 3 |
Unique
| Unique | 642 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 1785.34 |
|---|---|
| 2nd row | 15.0 |
| 3rd row | 49.0 |
| 4th row | 30.0 |
| 5th row | 3456.48 |
| Value | Count | Frequency (%) |
| 0.5 | 5165 | 6.8% |
| 3.0 | 4287 | 5.7% |
| 1.5 | 4042 | 5.4% |
| 1.0 | 3416 | 4.5% |
| 2.0 | 1940 | 2.6% |
| 10.0 | 1543 | 2.0% |
| 15.0 | 1120 | 1.5% |
| 0.0 | 937 | 1.2% |
| 17.5 | 933 | 1.2% |
| 2.5 | 888 | 1.2% |
| Other values (2243) | 51157 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 75428 | |
| 0 | 60955 | |
| 5 | 42952 | |
| 1 | 38545 | |
| 2 | 28079 | 8.7% |
| 3 | 18294 | 5.7% |
| 7 | 13595 | 4.2% |
| 6 | 12603 | 3.9% |
| 4 | 10944 | 3.4% |
| 8 | 10252 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 245725 | |
| Other Punctuation | 75428 | 23.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 60955 | |
| 5 | 42952 | |
| 1 | 38545 | |
| 2 | 28079 | |
| 3 | 18294 | 7.4% |
| 7 | 13595 | 5.5% |
| 6 | 12603 | 5.1% |
| 4 | 10944 | 4.5% |
| 8 | 10252 | 4.2% |
| 9 | 9506 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 75428 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 321153 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 75428 | |
| 0 | 60955 | |
| 5 | 42952 | |
| 1 | 38545 | |
| 2 | 28079 | 8.7% |
| 3 | 18294 | 5.7% |
| 7 | 13595 | 4.2% |
| 6 | 12603 | 3.9% |
| 4 | 10944 | 3.4% |
| 8 | 10252 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 321153 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 75428 | |
| 0 | 60955 | |
| 5 | 42952 | |
| 1 | 38545 | |
| 2 | 28079 | 8.7% |
| 3 | 18294 | 5.7% |
| 7 | 13595 | 4.2% |
| 6 | 12603 | 3.9% |
| 4 | 10944 | 3.4% |
| 8 | 10252 | 3.2% |
depthAccuracy
Text
Missing 
| Distinct | 239 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 272182 |
| Missing (%) | 80.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 3 |
| Mean length | 3.292753975 |
| Min length | 3 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 49.0 |
| 4th row | 5.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 26153 | |
| 0.5 | 5488 | 8.3% |
| 1.5 | 4808 | 7.3% |
| 1.0 | 2977 | 4.5% |
| 2.0 | 2346 | 3.6% |
| 2.5 | 2341 | 3.6% |
| 3.0 | 1911 | 2.9% |
| 0.25 | 1393 | 2.1% |
| 5.0 | 1251 | 1.9% |
| 4.0 | 1139 | 1.7% |
| Other values (229) | 16105 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 81443 | |
| . | 65912 | |
| 5 | 27649 | 12.7% |
| 1 | 12282 | 5.7% |
| 2 | 8722 | 4.0% |
| 3 | 4835 | 2.2% |
| 4 | 4807 | 2.2% |
| 7 | 4279 | 2.0% |
| 9 | 3447 | 1.6% |
| 6 | 2344 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 151120 | |
| Other Punctuation | 65912 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 81443 | |
| 5 | 27649 | 18.3% |
| 1 | 12282 | 8.1% |
| 2 | 8722 | 5.8% |
| 3 | 4835 | 3.2% |
| 4 | 4807 | 3.2% |
| 7 | 4279 | 2.8% |
| 9 | 3447 | 2.3% |
| 6 | 2344 | 1.6% |
| 8 | 1312 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 65912 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 217032 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 81443 | |
| . | 65912 | |
| 5 | 27649 | 12.7% |
| 1 | 12282 | 5.7% |
| 2 | 8722 | 4.0% |
| 3 | 4835 | 2.2% |
| 4 | 4807 | 2.2% |
| 7 | 4279 | 2.0% |
| 9 | 3447 | 1.6% |
| 6 | 2344 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 217032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 81443 | |
| . | 65912 | |
| 5 | 27649 | 12.7% |
| 1 | 12282 | 5.7% |
| 2 | 8722 | 4.0% |
| 3 | 4835 | 2.2% |
| 4 | 4807 | 2.2% |
| 7 | 4279 | 2.0% |
| 9 | 3447 | 1.6% |
| 6 | 2344 | 1.1% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 170 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 335404 |
| Missing (%) | 99.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.28847584 |
| Min length | 3 |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 2259.882955420656 |
|---|---|
| 2nd row | 4272.023262908208 |
| 3rd row | 3797.0726080073355 |
| 4th row | 4837.263464897006 |
| 5th row | 1142.6033371248081 |
| Value | Count | Frequency (%) |
| 3997.886559051776 | 239 | 8.9% |
| 818.1211019658687 | 183 | 6.8% |
| 918.1358064728217 | 159 | 5.9% |
| 3435.2993691323722 | 143 | 5.3% |
| 1914.9010623948639 | 138 | 5.1% |
| 4049.579332802943 | 132 | 4.9% |
| 3247.910831883673 | 94 | 3.5% |
| 3286.3383926848273 | 91 | 3.4% |
| 2259.882955420656 | 89 | 3.3% |
| 3868.839758506256 | 70 | 2.6% |
| Other values (160) | 1352 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 5502 | |
| 8 | 5097 | |
| 2 | 4742 | |
| 9 | 4620 | |
| 1 | 4245 | |
| 6 | 4155 | |
| 7 | 4119 | |
| 5 | 3972 | |
| 4 | 3880 | |
| 0 | 3484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 43816 | |
| Other Punctuation | 2690 | 5.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 5502 | |
| 8 | 5097 | |
| 2 | 4742 | |
| 9 | 4620 | |
| 1 | 4245 | |
| 6 | 4155 | |
| 7 | 4119 | |
| 5 | 3972 | |
| 4 | 3880 | |
| 0 | 3484 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2690 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46506 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 5502 | |
| 8 | 5097 | |
| 2 | 4742 | |
| 9 | 4620 | |
| 1 | 4245 | |
| 6 | 4155 | |
| 7 | 4119 | |
| 5 | 3972 | |
| 4 | 3880 | |
| 0 | 3484 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46506 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 5502 | |
| 8 | 5097 | |
| 2 | 4742 | |
| 9 | 4620 | |
| 1 | 4245 | |
| 6 | 4155 | |
| 7 | 4119 | |
| 5 | 3972 | |
| 4 | 3880 | |
| 0 | 3484 |
issue
Text
Missing 
| Distinct | 172 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 45626 |
| Missing (%) | 13.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 197 |
|---|---|
| Median length | 154 |
| Mean length | 59.83685737 |
| Min length | 15 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES |
|---|---|
| 2nd row | GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_INVALID |
| 3rd row | GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
| 4th row | GEODETIC_DATUM_ASSUMED_WGS84 |
| 5th row | GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES |
| Value | Count | Frequency (%) |
| geodetic_datum_assumed_wgs84;continent_derived_from_coordinates | 81966 | |
| geodetic_datum_assumed_wgs84 | 50425 | |
| geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 48259 | |
| geodetic_datum_assumed_wgs84;continent_invalid | 23627 | 8.1% |
| continent_derived_from_country | 13936 | 4.8% |
| geodetic_datum_assumed_wgs84;geodetic_datum_invalid;continent_derived_from_coordinates;continent_invalid | 12417 | 4.2% |
| geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;taxon_match_higherrank | 10213 | 3.5% |
| continent_derived_from_country;continent_invalid | 6285 | 2.1% |
| geodetic_datum_assumed_wgs84;geodetic_datum_invalid;continent_derived_from_coordinates | 4317 | 1.5% |
| country_derived_from_coordinates;geodetic_datum_assumed_wgs84;continent_invalid | 3977 | 1.4% |
| Other values (162) | 37046 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1740521 | 9.9% |
| _ | 1614935 | 9.2% |
| D | 1532749 | 8.8% |
| T | 1465021 | 8.4% |
| N | 1338780 | 7.7% |
| I | 1261875 | 7.2% |
| O | 1227170 | 7.0% |
| S | 956544 | 5.5% |
| A | 942202 | 5.4% |
| C | 851797 | 4.9% |
| Other values (18) | 4568772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15039613 | |
| Connector Punctuation | 1614935 | 9.2% |
| Decimal Number | 509914 | 2.9% |
| Other Punctuation | 335904 | 1.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1740521 | |
| D | 1532749 | |
| T | 1465021 | |
| N | 1338780 | |
| I | 1261875 | |
| O | 1227170 | |
| S | 956544 | 6.4% |
| A | 942202 | 6.3% |
| C | 851797 | 5.7% |
| M | 778930 | 5.2% |
| Other values (14) | 2944024 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 254957 | |
| 8 | 254957 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1614935 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 335904 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15039613 | |
| Common | 2460753 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1740521 | |
| D | 1532749 | |
| T | 1465021 | |
| N | 1338780 | |
| I | 1261875 | |
| O | 1227170 | |
| S | 956544 | 6.4% |
| A | 942202 | 6.3% |
| C | 851797 | 5.7% |
| M | 778930 | 5.2% |
| Other values (14) | 2944024 |
Common
| Value | Count | Frequency (%) |
| _ | 1614935 | |
| ; | 335904 | 13.7% |
| 4 | 254957 | 10.4% |
| 8 | 254957 | 10.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17500366 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1740521 | 9.9% |
| _ | 1614935 | 9.2% |
| D | 1532749 | 8.8% |
| T | 1465021 | 8.4% |
| N | 1338780 | 7.7% |
| I | 1261875 | 7.2% |
| O | 1227170 | 7.0% |
| S | 956544 | 5.5% |
| A | 942202 | 5.4% |
| C | 851797 | 4.9% |
| Other values (18) | 4568772 |
mediaType
Text
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 324090 |
| Missing (%) | 95.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 362 |
|---|---|
| Median length | 10 |
| Mean length | 13.91095401 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 11788 | |
| stillimage;stillimage | 1471 | 10.5% |
| stillimage;stillimage;stillimage | 245 | 1.7% |
| stillimage;stillimage;stillimage;stillimage | 193 | 1.4% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 97 | 0.7% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 75 | 0.5% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 45 | 0.3% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 18 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 16 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 12 | 0.1% |
| Other values (9) | 44 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 37966 | |
| S | 18983 | |
| t | 18983 | |
| i | 18983 | |
| I | 18983 | |
| m | 18983 | |
| a | 18983 | |
| g | 18983 | |
| e | 18983 | |
| ; | 4979 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 151864 | |
| Uppercase Letter | 37966 | 19.5% |
| Other Punctuation | 4979 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 37966 | |
| t | 18983 | |
| i | 18983 | |
| m | 18983 | |
| a | 18983 | |
| g | 18983 | |
| e | 18983 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 18983 | |
| I | 18983 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 4979 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 189830 | |
| Common | 4979 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 37966 | |
| S | 18983 | |
| t | 18983 | |
| i | 18983 | |
| I | 18983 | |
| m | 18983 | |
| a | 18983 | |
| g | 18983 | |
| e | 18983 |
Common
| Value | Count | Frequency (%) |
| ; | 4979 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 194809 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 37966 | |
| S | 18983 | |
| t | 18983 | |
| i | 18983 | |
| I | 18983 | |
| m | 18983 | |
| a | 18983 | |
| g | 18983 | |
| e | 18983 | |
| ; | 4979 | 2.6% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.217271192 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 264632 | |
| false | 73457 | 21.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 338089 | |
| t | 264632 | |
| r | 264632 | |
| u | 264632 | |
| f | 73457 | 5.2% |
| a | 73457 | 5.2% |
| l | 73457 | 5.2% |
| s | 73457 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1425813 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 338089 | |
| t | 264632 | |
| r | 264632 | |
| u | 264632 | |
| f | 73457 | 5.2% |
| a | 73457 | 5.2% |
| l | 73457 | 5.2% |
| s | 73457 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1425813 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 338089 | |
| t | 264632 | |
| r | 264632 | |
| u | 264632 | |
| f | 73457 | 5.2% |
| a | 73457 | 5.2% |
| l | 73457 | 5.2% |
| s | 73457 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1425813 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 338089 | |
| t | 264632 | |
| r | 264632 | |
| u | 264632 | |
| f | 73457 | 5.2% |
| a | 73457 | 5.2% |
| l | 73457 | 5.2% |
| s | 73457 | 5.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.992439861 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 335533 | |
| true | 2556 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 335533 | |
| a | 335533 | |
| l | 335533 | |
| s | 335533 | |
| t | 2556 | 0.2% |
| r | 2556 | 0.2% |
| u | 2556 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1687889 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 335533 | |
| a | 335533 | |
| l | 335533 | |
| s | 335533 | |
| t | 2556 | 0.2% |
| r | 2556 | 0.2% |
| u | 2556 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1687889 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 335533 | |
| a | 335533 | |
| l | 335533 | |
| s | 335533 | |
| t | 2556 | 0.2% |
| r | 2556 | 0.2% |
| u | 2556 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1687889 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 335533 | |
| a | 335533 | |
| l | 335533 | |
| s | 335533 | |
| t | 2556 | 0.2% |
| r | 2556 | 0.2% |
| u | 2556 | 0.2% |
taxonKey
Text
| Distinct | 45746 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.660985717 |
| Min length | 1 |
Unique
| Unique | 9824 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | 10583418 |
|---|---|
| 2nd row | 5854277 |
| 3rd row | 5771 |
| 4th row | 4479 |
| 5th row | 2651085 |
| Value | Count | Frequency (%) |
| 0 | 6107 | 1.8% |
| 6841 | 3252 | 1.0% |
| 637 | 2297 | 0.7% |
| 2285664 | 2008 | 0.6% |
| 2329589 | 2006 | 0.6% |
| 2440447 | 1919 | 0.6% |
| 8324617 | 1660 | 0.5% |
| 7971837 | 1474 | 0.4% |
| 2431491 | 1334 | 0.4% |
| 2307333 | 1035 | 0.3% |
| Other values (45736) | 314997 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 349816 | |
| 1 | 269425 | |
| 4 | 234301 | |
| 3 | 228411 | |
| 7 | 210859 | |
| 5 | 205314 | |
| 8 | 200618 | |
| 9 | 192840 | |
| 0 | 183611 | |
| 6 | 176811 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2252006 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 349816 | |
| 1 | 269425 | |
| 4 | 234301 | |
| 3 | 228411 | |
| 7 | 210859 | |
| 5 | 205314 | |
| 8 | 200618 | |
| 9 | 192840 | |
| 0 | 183611 | |
| 6 | 176811 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2252006 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 349816 | |
| 1 | 269425 | |
| 4 | 234301 | |
| 3 | 228411 | |
| 7 | 210859 | |
| 5 | 205314 | |
| 8 | 200618 | |
| 9 | 192840 | |
| 0 | 183611 | |
| 6 | 176811 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2252006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 349816 | |
| 1 | 269425 | |
| 4 | 234301 | |
| 3 | 228411 | |
| 7 | 210859 | |
| 5 | 205314 | |
| 8 | 200618 | |
| 9 | 192840 | |
| 0 | 183611 | |
| 6 | 176811 |
acceptedTaxonKey
Text
Missing 
| Distinct | 44951 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 6112 |
| Missing (%) | 1.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.770436349 |
| Min length | 1 |
Unique
| Unique | 9417 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | 10583418 |
|---|---|
| 2nd row | 5854277 |
| 3rd row | 5771 |
| 4th row | 4479 |
| 5th row | 2651085 |
| Value | Count | Frequency (%) |
| 6841 | 3252 | 1.0% |
| 637 | 2297 | 0.7% |
| 2285664 | 2008 | 0.6% |
| 2329589 | 2006 | 0.6% |
| 2440447 | 1919 | 0.6% |
| 8324617 | 1660 | 0.5% |
| 8770992 | 1474 | 0.4% |
| 2431491 | 1334 | 0.4% |
| 2307333 | 1035 | 0.3% |
| 68 | 875 | 0.3% |
| Other values (44941) | 314122 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 335747 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207519 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2247663 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 335747 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207519 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2247663 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 335747 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207519 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2247663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 335747 | |
| 1 | 273500 | |
| 4 | 233904 | |
| 3 | 226967 | |
| 5 | 207519 | |
| 7 | 206909 | |
| 8 | 206081 | |
| 9 | 198048 | |
| 0 | 180588 | |
| 6 | 178400 |
kingdomKey
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 1 | 291926 | |
| 6 | 35530 | 10.5% |
| 0 | 6107 | 1.8% |
| 4 | 3038 | 0.9% |
| 3 | 1166 | 0.3% |
| 5 | 322 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 291926 | |
| 6 | 35530 | 10.5% |
| 0 | 6107 | 1.8% |
| 4 | 3038 | 0.9% |
| 3 | 1166 | 0.3% |
| 5 | 322 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 338089 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 291926 | |
| 6 | 35530 | 10.5% |
| 0 | 6107 | 1.8% |
| 4 | 3038 | 0.9% |
| 3 | 1166 | 0.3% |
| 5 | 322 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 338089 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 291926 | |
| 6 | 35530 | 10.5% |
| 0 | 6107 | 1.8% |
| 4 | 3038 | 0.9% |
| 3 | 1166 | 0.3% |
| 5 | 322 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 338089 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 291926 | |
| 6 | 35530 | 10.5% |
| 0 | 6107 | 1.8% |
| 4 | 3038 | 0.9% |
| 3 | 1166 | 0.3% |
| 5 | 322 | 0.1% |
phylumKey
Text
Missing 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6812 |
| Missing (%) | 2.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.519107588 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 54 |
|---|---|
| 2nd row | 42 |
| 3rd row | 42 |
| 4th row | 54 |
| 5th row | 7707728 |
| Value | Count | Frequency (%) |
| 54 | 145971 | |
| 44 | 103372 | |
| 7707728 | 30584 | 9.2% |
| 52 | 20737 | 6.3% |
| 42 | 11327 | 3.4% |
| 43 | 3177 | 1.0% |
| 106 | 2942 | 0.9% |
| 8770992 | 2110 | 0.6% |
| 50 | 1630 | 0.5% |
| 36 | 1622 | 0.5% |
| Other values (30) | 7810 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 368289 | |
| 5 | 171646 | |
| 7 | 127682 | 15.3% |
| 2 | 65080 | 7.8% |
| 0 | 39492 | 4.7% |
| 8 | 36207 | 4.3% |
| 6 | 7247 | 0.9% |
| 3 | 7107 | 0.9% |
| 1 | 5946 | 0.7% |
| 9 | 5839 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 834535 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 368289 | |
| 5 | 171646 | |
| 7 | 127682 | 15.3% |
| 2 | 65080 | 7.8% |
| 0 | 39492 | 4.7% |
| 8 | 36207 | 4.3% |
| 6 | 7247 | 0.9% |
| 3 | 7107 | 0.9% |
| 1 | 5946 | 0.7% |
| 9 | 5839 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 834535 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 368289 | |
| 5 | 171646 | |
| 7 | 127682 | 15.3% |
| 2 | 65080 | 7.8% |
| 0 | 39492 | 4.7% |
| 8 | 36207 | 4.3% |
| 6 | 7247 | 0.9% |
| 3 | 7107 | 0.9% |
| 1 | 5946 | 0.7% |
| 9 | 5839 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 834535 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 368289 | |
| 5 | 171646 | |
| 7 | 127682 | 15.3% |
| 2 | 65080 | 7.8% |
| 0 | 39492 | 4.7% |
| 8 | 36207 | 4.3% |
| 6 | 7247 | 0.9% |
| 3 | 7107 | 0.9% |
| 1 | 5946 | 0.7% |
| 9 | 5839 | 0.7% |
classKey
Text
Missing 
| Distinct | 105 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52277 |
| Missing (%) | 15.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.277555219 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 216 |
|---|---|
| 2nd row | 256 |
| 3rd row | 256 |
| 4th row | 229 |
| 5th row | 7228684 |
| Value | Count | Frequency (%) |
| 216 | 112951 | |
| 229 | 27895 | 9.8% |
| 359 | 24478 | 8.6% |
| 131 | 18384 | 6.4% |
| 220 | 15795 | 5.5% |
| 196 | 10876 | 3.8% |
| 256 | 10686 | 3.7% |
| 137 | 9771 | 3.4% |
| 225 | 9525 | 3.3% |
| 11592253 | 9481 | 3.3% |
| Other values (95) | 35975 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 285083 | |
| 1 | 210874 | |
| 6 | 146616 | |
| 9 | 79319 | 8.5% |
| 3 | 77102 | 8.2% |
| 5 | 74835 | 8.0% |
| 0 | 24095 | 2.6% |
| 7 | 19107 | 2.0% |
| 4 | 11458 | 1.2% |
| 8 | 8292 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 936781 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 285083 | |
| 1 | 210874 | |
| 6 | 146616 | |
| 9 | 79319 | 8.5% |
| 3 | 77102 | 8.2% |
| 5 | 74835 | 8.0% |
| 0 | 24095 | 2.6% |
| 7 | 19107 | 2.0% |
| 4 | 11458 | 1.2% |
| 8 | 8292 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 936781 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 285083 | |
| 1 | 210874 | |
| 6 | 146616 | |
| 9 | 79319 | 8.5% |
| 3 | 77102 | 8.2% |
| 5 | 74835 | 8.0% |
| 0 | 24095 | 2.6% |
| 7 | 19107 | 2.0% |
| 4 | 11458 | 1.2% |
| 8 | 8292 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 936781 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 285083 | |
| 1 | 210874 | |
| 6 | 146616 | |
| 9 | 79319 | 8.5% |
| 3 | 77102 | 8.2% |
| 5 | 74835 | 8.0% |
| 0 | 24095 | 2.6% |
| 7 | 19107 | 2.0% |
| 4 | 11458 | 1.2% |
| 8 | 8292 | 0.9% |
orderKey
Text
Missing 
| Distinct | 531 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 30347 |
| Missing (%) | 9.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.491471891 |
| Min length | 3 |
Unique
| Unique | 48 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 797 |
|---|---|
| 2nd row | 1080 |
| 3rd row | 864 |
| 4th row | 637 |
| 5th row | 392 |
| Value | Count | Frequency (%) |
| 797 | 79519 | |
| 587 | 25783 | 8.4% |
| 637 | 23755 | 7.7% |
| 1470 | 10132 | 3.3% |
| 952 | 10009 | 3.3% |
| 1457 | 8496 | 2.8% |
| 1459 | 8406 | 2.7% |
| 953 | 8204 | 2.7% |
| 1369 | 7858 | 2.6% |
| 733 | 7808 | 2.5% |
| Other values (521) | 117777 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 281408 | |
| 9 | 172836 | |
| 1 | 111891 | 10.4% |
| 3 | 100100 | 9.3% |
| 5 | 91866 | 8.5% |
| 4 | 74011 | 6.9% |
| 8 | 71573 | 6.7% |
| 0 | 64573 | 6.0% |
| 6 | 63725 | 5.9% |
| 2 | 42507 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1074490 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 281408 | |
| 9 | 172836 | |
| 1 | 111891 | 10.4% |
| 3 | 100100 | 9.3% |
| 5 | 91866 | 8.5% |
| 4 | 74011 | 6.9% |
| 8 | 71573 | 6.7% |
| 0 | 64573 | 6.0% |
| 6 | 63725 | 5.9% |
| 2 | 42507 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1074490 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 281408 | |
| 9 | 172836 | |
| 1 | 111891 | 10.4% |
| 3 | 100100 | 9.3% |
| 5 | 91866 | 8.5% |
| 4 | 74011 | 6.9% |
| 8 | 71573 | 6.7% |
| 0 | 64573 | 6.0% |
| 6 | 63725 | 5.9% |
| 2 | 42507 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1074490 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 281408 | |
| 9 | 172836 | |
| 1 | 111891 | 10.4% |
| 3 | 100100 | 9.3% |
| 5 | 91866 | 8.5% |
| 4 | 74011 | 6.9% |
| 8 | 71573 | 6.7% |
| 0 | 64573 | 6.0% |
| 6 | 63725 | 5.9% |
| 2 | 42507 | 4.0% |
familyKey
Text
Missing 
| Distinct | 3094 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 19910 |
| Missing (%) | 5.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.310185302 |
| Min length | 4 |
Unique
| Unique | 308 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 4705755 |
|---|---|
| 2nd row | 5854277 |
| 3rd row | 5771 |
| 4th row | 4479 |
| 5th row | 2373 |
| Value | Count | Frequency (%) |
| 4479 | 12102 | 3.8% |
| 6950 | 12012 | 3.8% |
| 7015 | 7500 | 2.4% |
| 5343 | 7246 | 2.3% |
| 6748 | 6784 | 2.1% |
| 3073 | 6677 | 2.1% |
| 5314 | 5540 | 1.7% |
| 4532185 | 5452 | 1.7% |
| 5854277 | 5009 | 1.6% |
| 6841 | 4930 | 1.5% |
| Other values (3084) | 244932 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 190928 | |
| 4 | 186306 | |
| 3 | 170146 | |
| 7 | 153207 | |
| 6 | 142449 | |
| 8 | 120810 | |
| 9 | 112349 | |
| 2 | 107749 | |
| 0 | 97175 | |
| 1 | 90313 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1371432 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 190928 | |
| 4 | 186306 | |
| 3 | 170146 | |
| 7 | 153207 | |
| 6 | 142449 | |
| 8 | 120810 | |
| 9 | 112349 | |
| 2 | 107749 | |
| 0 | 97175 | |
| 1 | 90313 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1371432 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 190928 | |
| 4 | 186306 | |
| 3 | 170146 | |
| 7 | 153207 | |
| 6 | 142449 | |
| 8 | 120810 | |
| 9 | 112349 | |
| 2 | 107749 | |
| 0 | 97175 | |
| 1 | 90313 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1371432 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 190928 | |
| 4 | 186306 | |
| 3 | 170146 | |
| 7 | 153207 | |
| 6 | 142449 | |
| 8 | 120810 | |
| 9 | 112349 | |
| 2 | 107749 | |
| 0 | 97175 | |
| 1 | 90313 |
genusKey
Text
Missing 
| Distinct | 19382 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 34396 |
| Missing (%) | 10.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.00830101 |
| Min length | 7 |
Unique
| Unique | 2161 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 4686308 |
|---|---|
| 2nd row | 2651085 |
| 3rd row | 1068113 |
| 4th row | 4609405 |
| 5th row | 2406899 |
| Value | Count | Frequency (%) |
| 2431477 | 4671 | 1.5% |
| 4646327 | 4236 | 1.4% |
| 2227127 | 3675 | 1.2% |
| 2285664 | 2587 | 0.9% |
| 2329589 | 2006 | 0.7% |
| 2440446 | 1919 | 0.6% |
| 2227317 | 1707 | 0.6% |
| 2440326 | 1662 | 0.5% |
| 4312471 | 1424 | 0.5% |
| 8782549 | 1420 | 0.5% |
| Other values (19372) | 278391 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 352255 | |
| 4 | 258075 | |
| 1 | 241765 | |
| 3 | 237700 | |
| 7 | 202632 | |
| 8 | 184073 | |
| 9 | 172673 | |
| 6 | 170699 | |
| 5 | 157110 | |
| 0 | 151425 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2128407 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 352255 | |
| 4 | 258075 | |
| 1 | 241765 | |
| 3 | 237700 | |
| 7 | 202632 | |
| 8 | 184073 | |
| 9 | 172673 | |
| 6 | 170699 | |
| 5 | 157110 | |
| 0 | 151425 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2128407 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 352255 | |
| 4 | 258075 | |
| 1 | 241765 | |
| 3 | 237700 | |
| 7 | 202632 | |
| 8 | 184073 | |
| 9 | 172673 | |
| 6 | 170699 | |
| 5 | 157110 | |
| 0 | 151425 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2128407 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 352255 | |
| 4 | 258075 | |
| 1 | 241765 | |
| 3 | 237700 | |
| 7 | 202632 | |
| 8 | 184073 | |
| 9 | 172673 | |
| 6 | 170699 | |
| 5 | 157110 | |
| 0 | 151425 |
speciesKey
Text
Missing 
| Distinct | 37032 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 89520 |
| Missing (%) | 26.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.043576561 |
| Min length | 7 |
Unique
| Unique | 7679 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | 10583418 |
|---|---|
| 2nd row | 1068127 |
| 3rd row | 11155573 |
| 4th row | 2406900 |
| 5th row | 6788608 |
| Value | Count | Frequency (%) |
| 2440447 | 1919 | 0.8% |
| 8324617 | 1660 | 0.7% |
| 2431491 | 1334 | 0.5% |
| 2431423 | 815 | 0.3% |
| 2432006 | 764 | 0.3% |
| 2431513 | 639 | 0.3% |
| 5218985 | 601 | 0.2% |
| 9001095 | 579 | 0.2% |
| 4312492 | 579 | 0.2% |
| 2431543 | 566 | 0.2% |
| Other values (37022) | 239118 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 263968 | |
| 1 | 217498 | |
| 4 | 185797 | |
| 3 | 173534 | |
| 5 | 166110 | |
| 8 | 162114 | |
| 7 | 157335 | |
| 9 | 155062 | |
| 0 | 141342 | |
| 6 | 128090 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1750850 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 263968 | |
| 1 | 217498 | |
| 4 | 185797 | |
| 3 | 173534 | |
| 5 | 166110 | |
| 8 | 162114 | |
| 7 | 157335 | |
| 9 | 155062 | |
| 0 | 141342 | |
| 6 | 128090 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1750850 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 263968 | |
| 1 | 217498 | |
| 4 | 185797 | |
| 3 | 173534 | |
| 5 | 166110 | |
| 8 | 162114 | |
| 7 | 157335 | |
| 9 | 155062 | |
| 0 | 141342 | |
| 6 | 128090 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1750850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 263968 | |
| 1 | 217498 | |
| 4 | 185797 | |
| 3 | 173534 | |
| 5 | 166110 | |
| 8 | 162114 | |
| 7 | 157335 | |
| 9 | 155062 | |
| 0 | 141342 | |
| 6 | 128090 |
species
Text
Missing 
| Distinct | 37025 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 89520 |
| Missing (%) | 26.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 30 |
| Mean length | 19.21962474 |
| Min length | 8 |
Unique
| Unique | 7675 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | Rectiostoma fernaldella |
|---|---|
| 2nd row | Mesontoplatys bolzi |
| 3rd row | Dulcerana granularis |
| 4th row | Amanses scopas |
| 5th row | Calyptogena extenta |
| Value | Count | Frequency (%) |
| plethodon | 4564 | 0.9% |
| faxonius | 4236 | 0.9% |
| procambarus | 3408 | 0.7% |
| truncatus | 1929 | 0.4% |
| tursiops | 1919 | 0.4% |
| cinereus | 1885 | 0.4% |
| delphis | 1660 | 0.3% |
| delphinus | 1660 | 0.3% |
| cambarus | 1583 | 0.3% |
| anolis | 1411 | 0.3% |
| Other values (38206) | 472895 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 541195 | 11.3% |
| i | 417200 | 8.7% |
| s | 361384 | 7.6% |
| e | 336874 | 7.1% |
| o | 316429 | 6.6% |
| r | 303496 | 6.4% |
| l | 260561 | 5.5% |
| 248576 | 5.2% | |
| n | 248318 | 5.2% |
| u | 245629 | 5.1% |
| Other values (44) | 1497837 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4280184 | |
| Uppercase Letter | 248580 | 5.2% |
| Space Separator | 248576 | 5.2% |
| Dash Punctuation | 159 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 541195 | |
| i | 417200 | |
| s | 361384 | 8.4% |
| e | 336874 | 7.9% |
| o | 316429 | 7.4% |
| r | 303496 | 7.1% |
| l | 260561 | 6.1% |
| n | 248318 | 5.8% |
| u | 245629 | 5.7% |
| t | 241459 | 5.6% |
| Other values (16) | 1007639 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 39940 | |
| C | 30327 | |
| A | 26578 | |
| S | 20139 | 8.1% |
| M | 15848 | 6.4% |
| E | 14671 | 5.9% |
| L | 12774 | 5.1% |
| H | 12456 | 5.0% |
| T | 11171 | 4.5% |
| D | 11092 | 4.5% |
| Other values (16) | 53584 |
Space Separator
| Value | Count | Frequency (%) |
| 248576 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 159 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4528764 | |
| Common | 248735 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 541195 | |
| i | 417200 | 9.2% |
| s | 361384 | 8.0% |
| e | 336874 | 7.4% |
| o | 316429 | 7.0% |
| r | 303496 | 6.7% |
| l | 260561 | 5.8% |
| n | 248318 | 5.5% |
| u | 245629 | 5.4% |
| t | 241459 | 5.3% |
| Other values (42) | 1256219 |
Common
| Value | Count | Frequency (%) |
| 248576 | ||
| - | 159 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4777499 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 541195 | 11.3% |
| i | 417200 | 8.7% |
| s | 361384 | 7.6% |
| e | 336874 | 7.1% |
| o | 316429 | 6.6% |
| r | 303496 | 6.4% |
| l | 260561 | 5.5% |
| 248576 | 5.2% | |
| n | 248318 | 5.2% |
| u | 245629 | 5.1% |
| Other values (44) | 1497837 |
Missing 
| Distinct | 44951 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 6112 |
| Missing (%) | 1.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 136 |
|---|---|
| Median length | 90 |
| Mean length | 31.13227826 |
| Min length | 5 |
Unique
| Unique | 9417 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | Rectiostoma fernaldella |
|---|---|
| 2nd row | Siboglinidae |
| 3rd row | Amphinomidae |
| 4th row | Cambaridae |
| 5th row | Polystichum Roth |
| Value | Count | Frequency (%) |
| 37558 | 3.0% | |
| linnaeus | 10225 | 0.8% |
| 1758 | 8032 | 0.6% |
| l | 6309 | 0.5% |
| 1985 | 5095 | 0.4% |
| plethodon | 4673 | 0.4% |
| walker | 4511 | 0.4% |
| jones | 4350 | 0.3% |
| faxonius | 4236 | 0.3% |
| procambarus | 3675 | 0.3% |
| Other values (49762) | 1162095 |
Most occurring characters
| Value | Count | Frequency (%) |
| 918777 | 8.9% | |
| a | 834408 | 8.1% |
| e | 661550 | 6.4% |
| i | 629479 | 6.1% |
| r | 530385 | 5.1% |
| s | 523738 | 5.1% |
| o | 514156 | 5.0% |
| n | 463991 | 4.5% |
| l | 422188 | 4.1% |
| t | 361951 | 3.5% |
| Other values (97) | 4474733 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6989605 | |
| Decimal Number | 1045012 | 10.1% |
| Space Separator | 918777 | 8.9% |
| Uppercase Letter | 744301 | 7.2% |
| Other Punctuation | 380647 | 3.7% |
| Close Punctuation | 126253 | 1.2% |
| Open Punctuation | 126253 | 1.2% |
| Dash Punctuation | 4427 | < 0.1% |
| Math Symbol | 78 | < 0.1% |
| Connector Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 834408 | |
| e | 661550 | 9.5% |
| i | 629479 | 9.0% |
| r | 530385 | 7.6% |
| s | 523738 | 7.5% |
| o | 514156 | 7.4% |
| n | 463991 | 6.6% |
| l | 422188 | 6.0% |
| t | 361951 | 5.2% |
| u | 355648 | 5.1% |
| Other values (43) | 1692111 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 65471 | 8.8% |
| C | 64551 | 8.7% |
| B | 58203 | 7.8% |
| S | 57866 | 7.8% |
| L | 52865 | 7.1% |
| M | 50300 | 6.8% |
| H | 47242 | 6.3% |
| A | 47144 | 6.3% |
| G | 43860 | 5.9% |
| D | 38280 | 5.1% |
| Other values (24) | 218519 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 307284 | |
| 8 | 202570 | |
| 9 | 141944 | |
| 7 | 72354 | 6.9% |
| 2 | 62471 | 6.0% |
| 0 | 62196 | 6.0% |
| 5 | 60671 | 5.8% |
| 6 | 50352 | 4.8% |
| 3 | 46493 | 4.4% |
| 4 | 38677 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 266394 | |
| . | 76357 | 20.1% |
| & | 37558 | 9.9% |
| ' | 338 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 918777 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 126253 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 126253 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4427 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 78 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7733906 | |
| Common | 2601450 | 25.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 834408 | 10.8% |
| e | 661550 | 8.6% |
| i | 629479 | 8.1% |
| r | 530385 | 6.9% |
| s | 523738 | 6.8% |
| o | 514156 | 6.6% |
| n | 463991 | 6.0% |
| l | 422188 | 5.5% |
| t | 361951 | 4.7% |
| u | 355648 | 4.6% |
| Other values (77) | 2436412 |
Common
| Value | Count | Frequency (%) |
| 918777 | ||
| 1 | 307284 | 11.8% |
| , | 266394 | 10.2% |
| 8 | 202570 | 7.8% |
| 9 | 141944 | 5.5% |
| ) | 126253 | 4.9% |
| ( | 126253 | 4.9% |
| . | 76357 | 2.9% |
| 7 | 72354 | 2.8% |
| 2 | 62471 | 2.4% |
| Other values (10) | 300793 | 11.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10311871 | |
| None | 23485 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 918777 | 8.9% | |
| a | 834408 | 8.1% |
| e | 661550 | 6.4% |
| i | 629479 | 6.1% |
| r | 530385 | 5.1% |
| s | 523738 | 5.1% |
| o | 514156 | 5.0% |
| n | 463991 | 4.5% |
| l | 422188 | 4.1% |
| t | 361951 | 3.5% |
| Other values (61) | 4451248 |
None
| Value | Count | Frequency (%) |
| ü | 8130 | |
| é | 6105 | |
| è | 2347 | 10.0% |
| ö | 1932 | 8.2% |
| å | 1562 | 6.7% |
| ä | 831 | 3.5% |
| ó | 715 | 3.0% |
| á | 474 | 2.0% |
| ø | 310 | 1.3% |
| É | 261 | 1.1% |
| Other values (26) | 818 | 3.5% |
Missing 
| Distinct | 46008 |
|---|---|
| Distinct (%) | 14.6% |
| Missing | 24039 |
| Missing (%) | 7.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 85 |
|---|---|
| Median length | 63 |
| Mean length | 18.57454904 |
| Min length | 3 |
Unique
| Unique | 10055 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | Rectiostoma fernaldella |
|---|---|
| 2nd row | Polystichum sp. |
| 3rd row | Mesontoplatys bolzi |
| 4th row | Bursa granularis |
| 5th row | Amanses scopas |
| Value | Count | Frequency (%) |
| sp | 50633 | 7.9% |
| plethodon | 4673 | 0.7% |
| orconectes | 4548 | 0.7% |
| indet | 4202 | 0.7% |
| procambarus | 3784 | 0.6% |
| unidentified | 3701 | 0.6% |
| bathymodiolus | 2598 | 0.4% |
| cinereus | 2325 | 0.4% |
| riftia | 2006 | 0.3% |
| truncatus | 1926 | 0.3% |
| Other values (42984) | 556915 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 627231 | 10.8% |
| i | 493887 | 8.5% |
| s | 465451 | 8.0% |
| e | 411782 | 7.1% |
| o | 369786 | 6.3% |
| r | 352399 | 6.0% |
| 323256 | 5.5% | |
| l | 300021 | 5.1% |
| n | 294984 | 5.1% |
| t | 291085 | 5.0% |
| Other values (69) | 1903548 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5130026 | |
| Space Separator | 323256 | 5.5% |
| Uppercase Letter | 316733 | 5.4% |
| Other Punctuation | 57342 | 1.0% |
| Open Punctuation | 2361 | < 0.1% |
| Close Punctuation | 2361 | < 0.1% |
| Decimal Number | 880 | < 0.1% |
| Connector Punctuation | 279 | < 0.1% |
| Dash Punctuation | 192 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 627231 | |
| i | 493887 | 9.6% |
| s | 465451 | 9.1% |
| e | 411782 | 8.0% |
| o | 369786 | 7.2% |
| r | 352399 | 6.9% |
| l | 300021 | 5.8% |
| n | 294984 | 5.8% |
| t | 291085 | 5.7% |
| u | 273206 | 5.3% |
| Other values (19) | 1250194 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 48084 | |
| C | 36649 | |
| A | 33460 | |
| S | 23813 | 7.5% |
| M | 18809 | 5.9% |
| E | 17848 | 5.6% |
| L | 16380 | 5.2% |
| H | 16021 | 5.1% |
| T | 14035 | 4.4% |
| D | 13545 | 4.3% |
| Other values (17) | 78089 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 270 | |
| 1 | 264 | |
| 2 | 99 | 11.2% |
| 3 | 65 | 7.4% |
| 6 | 56 | 6.4% |
| 7 | 46 | 5.2% |
| 8 | 26 | 3.0% |
| 4 | 23 | 2.6% |
| 5 | 17 | 1.9% |
| 9 | 14 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 56693 | |
| " | 304 | 0.5% |
| ' | 250 | 0.4% |
| , | 65 | 0.1% |
| / | 13 | < 0.1% |
| & | 11 | < 0.1% |
| ? | 5 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 323256 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2361 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2361 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 279 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 192 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5446759 | |
| Common | 386671 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 627231 | |
| i | 493887 | 9.1% |
| s | 465451 | 8.5% |
| e | 411782 | 7.6% |
| o | 369786 | 6.8% |
| r | 352399 | 6.5% |
| l | 300021 | 5.5% |
| n | 294984 | 5.4% |
| t | 291085 | 5.3% |
| u | 273206 | 5.0% |
| Other values (46) | 1566927 |
Common
| Value | Count | Frequency (%) |
| 323256 | ||
| . | 56693 | 14.7% |
| ( | 2361 | 0.6% |
| ) | 2361 | 0.6% |
| " | 304 | 0.1% |
| _ | 279 | 0.1% |
| 0 | 270 | 0.1% |
| 1 | 264 | 0.1% |
| ' | 250 | 0.1% |
| - | 192 | < 0.1% |
| Other values (13) | 441 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5833417 | |
| None | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 627231 | 10.8% |
| i | 493887 | 8.5% |
| s | 465451 | 8.0% |
| e | 411782 | 7.1% |
| o | 369786 | 6.3% |
| r | 352399 | 6.0% |
| 323256 | 5.5% | |
| l | 300021 | 5.1% |
| n | 294984 | 5.1% |
| t | 291085 | 5.0% |
| Other values (65) | 1903535 |
None
| Value | Count | Frequency (%) |
| ë | 9 | |
| ö | 2 | 15.4% |
| Á | 1 | 7.7% |
| é | 1 | 7.7% |
typifiedName
Text
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 338060 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 9.352941176 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Japonica |
|---|---|
| 2nd row | Pallidata |
| 3rd row | Lepidophaga |
| 4th row | Dives |
| 5th row | Furcifer |
| Value | Count | Frequency (%) |
| lepidophaga | 4 | |
| dives | 4 | |
| tartarella | 4 | |
| inexpectata | 4 | |
| japonica | 2 | 5.9% |
| pallidata | 2 | 5.9% |
| furcifer | 2 | 5.9% |
| pervada | 2 | 5.9% |
| echinopanicis | 2 | 5.9% |
| ruptifascia | 2 | 5.9% |
| Other values (3) | 6 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 56 | |
| i | 28 | 8.8% |
| e | 28 | 8.8% |
| t | 20 | 6.3% |
| r | 20 | 6.3% |
| p | 18 | 5.7% |
| l | 18 | 5.7% |
| o | 16 | 5.0% |
| n | 14 | 4.4% |
| c | 14 | 4.4% |
| Other values (19) | 86 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 284 | |
| Uppercase Letter | 34 | 10.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 56 | |
| i | 28 | |
| e | 28 | |
| t | 20 | 7.0% |
| r | 20 | 7.0% |
| p | 18 | 6.3% |
| l | 18 | 6.3% |
| o | 16 | 5.6% |
| n | 14 | 4.9% |
| c | 14 | 4.9% |
| Other values (8) | 52 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 6 | |
| P | 4 | |
| L | 4 | |
| I | 4 | |
| D | 4 | |
| J | 2 | 5.9% |
| F | 2 | 5.9% |
| E | 2 | 5.9% |
| R | 2 | 5.9% |
| C | 2 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 318 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 56 | |
| i | 28 | 8.8% |
| e | 28 | 8.8% |
| t | 20 | 6.3% |
| r | 20 | 6.3% |
| p | 18 | 5.7% |
| l | 18 | 5.7% |
| o | 16 | 5.0% |
| n | 14 | 4.4% |
| c | 14 | 4.4% |
| Other values (19) | 86 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 318 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 56 | |
| i | 28 | 8.8% |
| e | 28 | 8.8% |
| t | 20 | 6.3% |
| r | 20 | 6.3% |
| p | 18 | 5.7% |
| l | 18 | 5.7% |
| o | 16 | 5.0% |
| n | 14 | 4.4% |
| c | 14 | 4.4% |
| Other values (19) | 86 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 338089 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 338089 | |
| M | 338089 | |
| L | 338089 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1014267 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 338089 | |
| M | 338089 | |
| L | 338089 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1014267 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 338089 | |
| M | 338089 | |
| L | 338089 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1014267 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 338089 | |
| M | 338089 | |
| L | 338089 |
lastParsed
Text
| Distinct | 31887 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99607204 |
| Min length | 20 |
Unique
| Unique | 3500 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 2024-12-01T12:07:01.240Z |
|---|---|
| 2nd row | 2024-12-01T12:07:01.438Z |
| 3rd row | 2024-12-01T12:07:01.443Z |
| 4th row | 2024-12-01T12:07:01.449Z |
| 5th row | 2024-12-01T12:07:01.465Z |
| Value | Count | Frequency (%) |
| 2024-12-01t12:07:38.532z | 73 | < 0.1% |
| 2024-12-01t12:07:38.533z | 71 | < 0.1% |
| 2024-12-01t12:07:38.508z | 68 | < 0.1% |
| 2024-12-01t12:07:39.879z | 67 | < 0.1% |
| 2024-12-01t12:07:39.819z | 65 | < 0.1% |
| 2024-12-01t12:07:37.936z | 65 | < 0.1% |
| 2024-12-01t12:07:40.339z | 65 | < 0.1% |
| 2024-12-01t12:07:39.875z | 64 | < 0.1% |
| 2024-12-01t12:07:39.723z | 64 | < 0.1% |
| 2024-12-01t12:07:38.854z | 63 | < 0.1% |
| Other values (31877) | 337424 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| - | 676178 | |
| : | 676178 | |
| 4 | 503489 | 6.2% |
| 7 | 475460 | 5.9% |
| T | 338089 | 4.2% |
| Z | 338089 | 4.2% |
| . | 337757 | 4.2% |
| Other values (5) | 866870 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5746517 | |
| Other Punctuation | 1013935 | 12.5% |
| Dash Punctuation | 676178 | 8.3% |
| Uppercase Letter | 676178 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| 4 | 503489 | 8.8% |
| 7 | 475460 | 8.3% |
| 3 | 314692 | 5.5% |
| 8 | 145833 | 2.5% |
| 9 | 145808 | 2.5% |
| 6 | 137824 | 2.4% |
| 5 | 122713 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676178 | |
| . | 337757 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 338089 | |
| Z | 338089 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 676178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7436630 | |
| Latin | 676178 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| - | 676178 | |
| : | 676178 | |
| 4 | 503489 | 6.8% |
| 7 | 475460 | 6.4% |
| . | 337757 | 4.5% |
| 3 | 314692 | 4.2% |
| 8 | 145833 | 2.0% |
| Other values (3) | 406345 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| T | 338089 | |
| Z | 338089 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8112808 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1562001 | |
| 1 | 1172147 | |
| 0 | 1166550 | |
| - | 676178 | |
| : | 676178 | |
| 4 | 503489 | 6.2% |
| 7 | 475460 | 5.9% |
| T | 338089 | 4.2% |
| Z | 338089 | 4.2% |
| . | 337757 | 4.2% |
| Other values (5) | 866870 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-12-01T11:07:21.711Z |
|---|---|
| 2nd row | 2024-12-01T11:07:21.711Z |
| 3rd row | 2024-12-01T11:07:21.711Z |
| 4th row | 2024-12-01T11:07:21.711Z |
| 5th row | 2024-12-01T11:07:21.711Z |
| Value | Count | Frequency (%) |
| 2024-12-01t11:07:21.711z | 338089 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2366623 | |
| 2 | 1352356 | |
| 0 | 1014267 | |
| - | 676178 | 8.3% |
| : | 676178 | 8.3% |
| 7 | 676178 | 8.3% |
| 4 | 338089 | 4.2% |
| T | 338089 | 4.2% |
| . | 338089 | 4.2% |
| Z | 338089 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5747513 | |
| Other Punctuation | 1014267 | 12.5% |
| Dash Punctuation | 676178 | 8.3% |
| Uppercase Letter | 676178 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2366623 | |
| 2 | 1352356 | |
| 0 | 1014267 | |
| 7 | 676178 | 11.8% |
| 4 | 338089 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676178 | |
| . | 338089 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 338089 | |
| Z | 338089 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 676178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7437958 | |
| Latin | 676178 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2366623 | |
| 2 | 1352356 | |
| 0 | 1014267 | |
| - | 676178 | 9.1% |
| : | 676178 | 9.1% |
| 7 | 676178 | 9.1% |
| 4 | 338089 | 4.5% |
| . | 338089 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 338089 | |
| Z | 338089 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8114136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2366623 | |
| 2 | 1352356 | |
| 0 | 1014267 | |
| - | 676178 | 8.3% |
| : | 676178 | 8.3% |
| 7 | 676178 | 8.3% |
| 4 | 338089 | 4.2% |
| T | 338089 | 4.2% |
| . | 338089 | 4.2% |
| Z | 338089 | 4.2% |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10837 |
| Missing (%) | 3.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.461276611 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 176301 | |
| false | 150956 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 327257 | |
| t | 176301 | |
| r | 176301 | |
| u | 176301 | |
| f | 150956 | |
| a | 150956 | |
| l | 150956 | |
| s | 150956 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1459984 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 327257 | |
| t | 176301 | |
| r | 176301 | |
| u | 176301 | |
| f | 150956 | |
| a | 150956 | |
| l | 150956 | |
| s | 150956 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1459984 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 327257 | |
| t | 176301 | |
| r | 176301 | |
| u | 176301 | |
| f | 150956 | |
| a | 150956 | |
| l | 150956 | |
| s | 150956 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1459984 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 327257 | |
| t | 176301 | |
| r | 176301 | |
| u | 176301 | |
| f | 150956 | |
| a | 150956 | |
| l | 150956 | |
| s | 150956 |
isSequenced
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.789076249 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 266778 | |
| true | 71311 | 21.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 266778 | |
| a | 266778 | |
| l | 266778 | |
| s | 266778 | |
| t | 71311 | 4.4% |
| r | 71311 | 4.4% |
| u | 71311 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1619134 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 266778 | |
| a | 266778 | |
| l | 266778 | |
| s | 266778 | |
| t | 71311 | 4.4% |
| r | 71311 | 4.4% |
| u | 71311 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1619134 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 266778 | |
| a | 266778 | |
| l | 266778 | |
| s | 266778 | |
| t | 71311 | 4.4% |
| r | 71311 | 4.4% |
| u | 71311 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1619134 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 338089 | |
| f | 266778 | |
| a | 266778 | |
| l | 266778 | |
| s | 266778 | |
| t | 71311 | 4.4% |
| r | 71311 | 4.4% |
| u | 71311 | 4.4% |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12220 |
| Missing (%) | 3.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.89375035 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | LATIN_AMERICA |
| 3rd row | OCEANIA |
| 4th row | NORTH_AMERICA |
| 5th row | ASIA |
| Value | Count | Frequency (%) |
| north_america | 153734 | |
| latin_america | 78014 | |
| oceania | 37070 | 11.4% |
| asia | 32944 | 10.1% |
| africa | 17920 | 5.5% |
| europe | 5860 | 1.8% |
| antarctica | 332 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 718374 | |
| R | 409594 | |
| I | 398028 | |
| C | 287402 | |
| E | 280538 | 7.9% |
| N | 269150 | 7.6% |
| T | 232412 | 6.5% |
| _ | 231748 | 6.5% |
| M | 231748 | 6.5% |
| O | 196664 | 5.5% |
| Other values (6) | 294332 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3318242 | |
| Connector Punctuation | 231748 | 6.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 718374 | |
| R | 409594 | |
| I | 398028 | |
| C | 287402 | |
| E | 280538 | 8.5% |
| N | 269150 | 8.1% |
| T | 232412 | 7.0% |
| M | 231748 | 7.0% |
| O | 196664 | 5.9% |
| H | 153734 | 4.6% |
| Other values (5) | 140598 | 4.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 231748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3318242 | |
| Common | 231748 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 718374 | |
| R | 409594 | |
| I | 398028 | |
| C | 287402 | |
| E | 280538 | 8.5% |
| N | 269150 | 8.1% |
| T | 232412 | 7.0% |
| M | 231748 | 7.0% |
| O | 196664 | 5.9% |
| H | 153734 | 4.6% |
| Other values (5) | 140598 | 4.2% |
Common
| Value | Count | Frequency (%) |
| _ | 231748 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3549990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 718374 | |
| R | 409594 | |
| I | 398028 | |
| C | 287402 | |
| E | 280538 | 7.9% |
| N | 269150 | 7.6% |
| T | 232412 | 6.5% |
| _ | 231748 | 6.5% |
| M | 231748 | 6.5% |
| O | 196664 | 5.5% |
| Other values (6) | 294332 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 338089 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 676178 | |
| A | 676178 | |
| N | 338089 | |
| O | 338089 | |
| T | 338089 | |
| H | 338089 | |
| _ | 338089 | |
| M | 338089 | |
| E | 338089 | |
| I | 338089 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4057068 | |
| Connector Punctuation | 338089 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 676178 | |
| A | 676178 | |
| N | 338089 | |
| O | 338089 | |
| T | 338089 | |
| H | 338089 | |
| M | 338089 | |
| E | 338089 | |
| I | 338089 | |
| C | 338089 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 338089 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4057068 | |
| Common | 338089 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 676178 | |
| A | 676178 | |
| N | 338089 | |
| O | 338089 | |
| T | 338089 | |
| H | 338089 | |
| M | 338089 | |
| E | 338089 | |
| I | 338089 | |
| C | 338089 |
Common
| Value | Count | Frequency (%) |
| _ | 338089 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4395157 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 676178 | |
| A | 676178 | |
| N | 338089 | |
| O | 338089 | |
| T | 338089 | |
| H | 338089 | |
| _ | 338089 | |
| M | 338089 | |
| E | 338089 | |
| I | 338089 |
level0Gid
Text
Missing 
| Distinct | 174 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 157741 |
| Missing (%) | 46.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | CHN |
| 3rd row | KEN |
| 4th row | DJI |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 97215 | |
| mmr | 7455 | 4.1% |
| mex | 4807 | 2.7% |
| guy | 4333 | 2.4% |
| phl | 4062 | 2.3% |
| pyf | 3614 | 2.0% |
| chn | 3325 | 1.8% |
| bra | 2654 | 1.5% |
| sur | 2611 | 1.4% |
| mdg | 2411 | 1.3% |
| Other values (164) | 47866 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 113189 | |
| U | 111068 | |
| S | 106325 | |
| M | 27389 | 5.1% |
| R | 20032 | 3.7% |
| P | 16725 | 3.1% |
| G | 16411 | 3.0% |
| N | 16119 | 3.0% |
| C | 13892 | 2.6% |
| E | 11267 | 2.1% |
| Other values (18) | 88642 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 541053 | |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 113189 | |
| U | 111068 | |
| S | 106325 | |
| M | 27389 | 5.1% |
| R | 20032 | 3.7% |
| P | 16725 | 3.1% |
| G | 16411 | 3.0% |
| N | 16119 | 3.0% |
| C | 13892 | 2.6% |
| E | 11267 | 2.1% |
| Other values (16) | 88636 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 7 | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 541053 | |
| Common | 6 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 113189 | |
| U | 111068 | |
| S | 106325 | |
| M | 27389 | 5.1% |
| R | 20032 | 3.7% |
| P | 16725 | 3.1% |
| G | 16411 | 3.0% |
| N | 16119 | 3.0% |
| C | 13892 | 2.6% |
| E | 11267 | 2.1% |
| Other values (16) | 88636 |
Common
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 7 | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 541059 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 113189 | |
| U | 111068 | |
| S | 106325 | |
| M | 27389 | 5.1% |
| R | 20032 | 3.7% |
| P | 16725 | 3.1% |
| G | 16411 | 3.0% |
| N | 16119 | 3.0% |
| C | 13892 | 2.6% |
| E | 11267 | 2.1% |
| Other values (18) | 88642 |
level0Name
Text
Missing 
| Distinct | 174 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 157741 |
| Missing (%) | 46.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 11.06804988 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | China |
| 3rd row | Kenya |
| 4th row | Djibouti |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 97300 | |
| states | 97245 | |
| myanmar | 7455 | 2.5% |
| méxico | 4807 | 1.6% |
| guyana | 4333 | 1.4% |
| philippines | 4062 | 1.3% |
| french | 3933 | 1.3% |
| polynesia | 3614 | 1.2% |
| china | 3325 | 1.1% |
| new | 2794 | 0.9% |
| Other values (202) | 73833 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 308073 | |
| e | 235889 | |
| a | 215937 | |
| i | 164095 | |
| n | 156657 | |
| 122348 | 6.1% | |
| s | 118233 | 5.9% |
| d | 109469 | 5.5% |
| S | 105440 | 5.3% |
| U | 97741 | 4.9% |
| Other values (51) | 362274 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1573656 | |
| Uppercase Letter | 299357 | 15.0% |
| Space Separator | 122348 | 6.1% |
| Dash Punctuation | 528 | < 0.1% |
| Other Punctuation | 261 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 308073 | |
| e | 235889 | |
| a | 215937 | |
| i | 164095 | |
| n | 156657 | |
| s | 118233 | 7.5% |
| d | 109469 | 7.0% |
| r | 36548 | 2.3% |
| o | 33176 | 2.1% |
| u | 30222 | 1.9% |
| Other values (21) | 165357 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 105440 | |
| U | 97741 | |
| M | 16188 | 5.4% |
| P | 15551 | 5.2% |
| C | 12459 | 4.2% |
| G | 9979 | 3.3% |
| B | 5964 | 2.0% |
| A | 5868 | 2.0% |
| R | 4788 | 1.6% |
| F | 4584 | 1.5% |
| Other values (13) | 20795 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 133 | |
| . | 78 | |
| ' | 50 | 19.2% |
Space Separator
| Value | Count | Frequency (%) |
| 122348 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 528 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1873013 | |
| Common | 123143 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 308073 | |
| e | 235889 | |
| a | 215937 | |
| i | 164095 | |
| n | 156657 | |
| s | 118233 | 6.3% |
| d | 109469 | 5.8% |
| S | 105440 | 5.6% |
| U | 97741 | 5.2% |
| r | 36548 | 2.0% |
| Other values (44) | 324931 |
Common
| Value | Count | Frequency (%) |
| 122348 | ||
| - | 528 | 0.4% |
| , | 133 | 0.1% |
| . | 78 | 0.1% |
| ' | 50 | < 0.1% |
| ( | 3 | < 0.1% |
| ) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1987453 | |
| None | 8703 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 308073 | |
| e | 235889 | |
| a | 215937 | |
| i | 164095 | |
| n | 156657 | |
| 122348 | 6.2% | |
| s | 118233 | 5.9% |
| d | 109469 | 5.5% |
| S | 105440 | 5.3% |
| U | 97741 | 4.9% |
| Other values (46) | 353571 |
None
| Value | Count | Frequency (%) |
| é | 5680 | |
| ç | 1227 | 14.1% |
| ã | 873 | 10.0% |
| í | 873 | 10.0% |
| ô | 50 | 0.6% |
level1Gid
Text
Missing 
| Distinct | 1235 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 158980 |
| Missing (%) | 47.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.591483636 |
| Min length | 6 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USA.3_1 |
|---|---|
| 2nd row | CHN.29_1 |
| 3rd row | KEN.20_1 |
| 4th row | DJI.3_1 |
| 5th row | USA.2_1 |
| Value | Count | Frequency (%) |
| usa.5_1 | 10272 | 5.7% |
| usa.44_1 | 9642 | 5.4% |
| usa.3_1 | 9251 | 5.2% |
| usa.10_1 | 8853 | 4.9% |
| usa.47_1 | 6219 | 3.5% |
| mmr.14_1 | 5452 | 3.0% |
| usa.21_1 | 4649 | 2.6% |
| usa.34_1 | 4328 | 2.4% |
| usa.43_1 | 3079 | 1.7% |
| usa.32_1 | 3055 | 1.7% |
| Other values (1225) | 114314 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 235844 | |
| _ | 179114 | |
| . | 179079 | |
| A | 113189 | |
| U | 109841 | |
| S | 106325 | 7.8% |
| 4 | 56352 | 4.1% |
| 2 | 42387 | 3.1% |
| 3 | 40233 | 3.0% |
| M | 27389 | 2.0% |
| Other values (28) | 269988 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 537336 | |
| Decimal Number | 464212 | |
| Connector Punctuation | 179114 | 13.2% |
| Other Punctuation | 179079 | 13.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 113189 | |
| U | 109841 | |
| S | 106325 | |
| M | 27389 | 5.1% |
| R | 20032 | 3.7% |
| P | 16722 | 3.1% |
| G | 16411 | 3.1% |
| N | 16110 | 3.0% |
| C | 12662 | 2.4% |
| E | 11267 | 2.1% |
| Other values (16) | 87388 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 235844 | |
| 4 | 56352 | 12.1% |
| 2 | 42387 | 9.1% |
| 3 | 40233 | 8.7% |
| 5 | 20947 | 4.5% |
| 0 | 16021 | 3.5% |
| 9 | 15545 | 3.3% |
| 7 | 13298 | 2.9% |
| 6 | 12329 | 2.7% |
| 8 | 11256 | 2.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 179114 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 179079 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 822405 | |
| Latin | 537336 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 113189 | |
| U | 109841 | |
| S | 106325 | |
| M | 27389 | 5.1% |
| R | 20032 | 3.7% |
| P | 16722 | 3.1% |
| G | 16411 | 3.1% |
| N | 16110 | 3.0% |
| C | 12662 | 2.4% |
| E | 11267 | 2.1% |
| Other values (16) | 87388 |
Common
| Value | Count | Frequency (%) |
| 1 | 235844 | |
| _ | 179114 | |
| . | 179079 | |
| 4 | 56352 | 6.9% |
| 2 | 42387 | 5.2% |
| 3 | 40233 | 4.9% |
| 5 | 20947 | 2.5% |
| 0 | 16021 | 1.9% |
| 9 | 15545 | 1.9% |
| 7 | 13298 | 1.6% |
| Other values (2) | 23585 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1359741 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 235844 | |
| _ | 179114 | |
| . | 179079 | |
| A | 113189 | |
| U | 109841 | |
| S | 106325 | 7.8% |
| 4 | 56352 | 4.1% |
| 2 | 42387 | 3.1% |
| 3 | 40233 | 3.0% |
| M | 27389 | 2.0% |
| Other values (28) | 269988 |
level1Name
Text
Missing 
| Distinct | 1201 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 158980 |
| Missing (%) | 47.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 9.053652981 |
| Min length | 3 |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arizona |
|---|---|
| 2nd row | Xizang |
| 3rd row | Laikipia |
| 4th row | Djiboutii |
| 5th row | Alaska |
| Value | Count | Frequency (%) |
| california | 10397 | 4.5% |
| texas | 9642 | 4.2% |
| arizona | 9251 | 4.0% |
| florida | 8853 | 3.9% |
| virginia | 8047 | 3.5% |
| new | 6247 | 2.7% |
| carolina | 6031 | 2.6% |
| tanintharyi | 5452 | 2.4% |
| maryland | 4649 | 2.0% |
| north | 4454 | 1.9% |
| Other values (1333) | 155634 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 229700 | |
| i | 163412 | 10.1% |
| n | 127711 | 7.9% |
| r | 114166 | 7.0% |
| o | 112430 | 6.9% |
| e | 91904 | 5.7% |
| s | 74858 | 4.6% |
| l | 66916 | 4.1% |
| t | 53967 | 3.3% |
| 49543 | 3.1% | |
| Other values (91) | 537029 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1337352 | |
| Uppercase Letter | 227538 | 14.0% |
| Space Separator | 49543 | 3.1% |
| Dash Punctuation | 6638 | 0.4% |
| Other Punctuation | 559 | < 0.1% |
| Modifier Symbol | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 229700 | |
| i | 163412 | |
| n | 127711 | |
| r | 114166 | |
| o | 112430 | |
| e | 91904 | 6.9% |
| s | 74858 | 5.6% |
| l | 66916 | 5.0% |
| t | 53967 | 4.0% |
| u | 48010 | 3.6% |
| Other values (54) | 254278 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 33751 | |
| T | 24347 | |
| M | 23452 | |
| A | 19365 | 8.5% |
| S | 17430 | 7.7% |
| N | 16131 | 7.1% |
| V | 12256 | 5.4% |
| F | 10777 | 4.7% |
| P | 8121 | 3.6% |
| O | 5900 | 2.6% |
| Other values (19) | 56008 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 358 | |
| / | 136 | 24.3% |
| , | 55 | 9.8% |
| . | 6 | 1.1% |
| ! | 4 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 49543 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6638 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1564890 | |
| Common | 56746 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 229700 | |
| i | 163412 | 10.4% |
| n | 127711 | 8.2% |
| r | 114166 | 7.3% |
| o | 112430 | 7.2% |
| e | 91904 | 5.9% |
| s | 74858 | 4.8% |
| l | 66916 | 4.3% |
| t | 53967 | 3.4% |
| u | 48010 | 3.1% |
| Other values (83) | 481816 |
Common
| Value | Count | Frequency (%) |
| 49543 | ||
| - | 6638 | 11.7% |
| ' | 358 | 0.6% |
| / | 136 | 0.2% |
| , | 55 | 0.1% |
| . | 6 | < 0.1% |
| ` | 6 | < 0.1% |
| ! | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1608030 | |
| None | 13479 | 0.8% |
| Latin Ext Additional | 127 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 229700 | |
| i | 163412 | 10.2% |
| n | 127711 | 7.9% |
| r | 114166 | 7.1% |
| o | 112430 | 7.0% |
| e | 91904 | 5.7% |
| s | 74858 | 4.7% |
| l | 66916 | 4.2% |
| t | 53967 | 3.4% |
| 49543 | 3.1% | |
| Other values (50) | 523423 |
None
| Value | Count | Frequency (%) |
| Î | 3639 | |
| é | 2945 | |
| í | 2133 | |
| á | 1541 | |
| ã | 1017 | 7.5% |
| ó | 574 | 4.3% |
| ô | 236 | 1.8% |
| ñ | 215 | 1.6% |
| ü | 210 | 1.6% |
| š | 207 | 1.5% |
| Other values (26) | 762 | 5.7% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 82 | |
| ằ | 40 | |
| ị | 3 | 2.4% |
| ọ | 1 | 0.8% |
| ồ | 1 | 0.8% |
level2Gid
Text
Missing 
| Distinct | 4284 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 167237 |
| Missing (%) | 49.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.21143412 |
| Min length | 8 |
Unique
| Unique | 290 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | USA.3.2_1 |
|---|---|
| 2nd row | CHN.29.7_1 |
| 3rd row | KEN.20.2_1 |
| 4th row | DJI.3.1_2 |
| 5th row | USA.2.9_1 |
| Value | Count | Frequency (%) |
| mmr.14.2_1 | 3998 | 2.3% |
| usa.3.2_1 | 3339 | 2.0% |
| usa.3.11_1 | 2680 | 1.6% |
| usa.9.1_1 | 2296 | 1.3% |
| guy.2.8_1 | 2193 | 1.3% |
| usa.5.37_1 | 1794 | 1.1% |
| usa.26.95_1 | 1453 | 0.9% |
| usa.32.26_1 | 1409 | 0.8% |
| usa.44.22_1 | 1372 | 0.8% |
| mmr.14.3_1 | 1320 | 0.8% |
| Other values (4274) | 149003 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 341679 | |
| 1 | 278548 | |
| _ | 170857 | |
| A | 112729 | 6.5% |
| U | 109431 | 6.3% |
| S | 105468 | 6.0% |
| 2 | 95569 | 5.5% |
| 4 | 80788 | 4.6% |
| 3 | 72057 | 4.1% |
| 5 | 47041 | 2.7% |
| Other values (28) | 330528 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 719594 | |
| Uppercase Letter | 512565 | |
| Other Punctuation | 341679 | |
| Connector Punctuation | 170857 | 9.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 112729 | |
| U | 109431 | |
| S | 105468 | |
| M | 26536 | 5.2% |
| R | 19482 | 3.8% |
| N | 16096 | 3.1% |
| G | 15657 | 3.1% |
| C | 12491 | 2.4% |
| P | 11988 | 2.3% |
| E | 11168 | 2.2% |
| Other values (16) | 71519 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 278548 | |
| 2 | 95569 | 13.3% |
| 4 | 80788 | 11.2% |
| 3 | 72057 | 10.0% |
| 5 | 47041 | 6.5% |
| 6 | 32059 | 4.5% |
| 0 | 29267 | 4.1% |
| 7 | 29047 | 4.0% |
| 9 | 28157 | 3.9% |
| 8 | 27061 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 341679 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 170857 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1232130 | |
| Latin | 512565 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 112729 | |
| U | 109431 | |
| S | 105468 | |
| M | 26536 | 5.2% |
| R | 19482 | 3.8% |
| N | 16096 | 3.1% |
| G | 15657 | 3.1% |
| C | 12491 | 2.4% |
| P | 11988 | 2.3% |
| E | 11168 | 2.2% |
| Other values (16) | 71519 |
Common
| Value | Count | Frequency (%) |
| . | 341679 | |
| 1 | 278548 | |
| _ | 170857 | |
| 2 | 95569 | 7.8% |
| 4 | 80788 | 6.6% |
| 3 | 72057 | 5.8% |
| 5 | 47041 | 3.8% |
| 6 | 32059 | 2.6% |
| 0 | 29267 | 2.4% |
| 7 | 29047 | 2.4% |
| Other values (2) | 55218 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1744695 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 341679 | |
| 1 | 278548 | |
| _ | 170857 | |
| A | 112729 | 6.5% |
| U | 109431 | 6.3% |
| S | 105468 | 6.0% |
| 2 | 95569 | 5.5% |
| 4 | 80788 | 4.6% |
| 3 | 72057 | 4.1% |
| 5 | 47041 | 2.7% |
| Other values (28) | 330528 |
level2Name
Text
Missing 
| Distinct | 3706 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 167249 |
| Missing (%) | 49.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.680839357 |
| Min length | 2 |
Unique
| Unique | 227 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Cochise |
|---|---|
| 2nd row | Shigatse |
| 3rd row | Laikipia North |
| 4th row | Djiboutii |
| 5th row | Haines |
| Value | Count | Frequency (%) |
| of | 6496 | 2.8% |
| san | 5277 | 2.3% |
| kawthoung | 3998 | 1.7% |
| region | 3757 | 1.6% |
| rest | 3754 | 1.6% |
| cochise | 3339 | 1.5% |
| saint | 2993 | 1.3% |
| city | 2937 | 1.3% |
| pima | 2680 | 1.2% |
| columbia | 2348 | 1.0% |
| Other values (3993) | 191864 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 169779 | 11.4% |
| o | 113886 | 7.7% |
| e | 111771 | 7.5% |
| n | 109067 | 7.4% |
| i | 102293 | 6.9% |
| r | 76609 | 5.2% |
| t | 66878 | 4.5% |
| 58598 | 4.0% | |
| s | 57490 | 3.9% |
| l | 54814 | 3.7% |
| Other values (121) | 561893 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1192084 | |
| Uppercase Letter | 220329 | 14.9% |
| Space Separator | 58598 | 4.0% |
| Dash Punctuation | 4331 | 0.3% |
| Decimal Number | 4160 | 0.3% |
| Other Punctuation | 2707 | 0.2% |
| Open Punctuation | 385 | < 0.1% |
| Close Punctuation | 297 | < 0.1% |
| Math Symbol | 187 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 169779 | |
| o | 113886 | |
| e | 111771 | |
| n | 109067 | 9.1% |
| i | 102293 | 8.6% |
| r | 76609 | 6.4% |
| t | 66878 | 5.6% |
| s | 57490 | 4.8% |
| l | 54814 | 4.6% |
| u | 50124 | 4.2% |
| Other values (62) | 279373 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 28869 | |
| S | 22774 | 10.3% |
| M | 15767 | 7.2% |
| B | 14864 | 6.7% |
| L | 14643 | 6.6% |
| P | 13514 | 6.1% |
| A | 12476 | 5.7% |
| D | 11771 | 5.3% |
| R | 11687 | 5.3% |
| K | 10475 | 4.8% |
| Other values (27) | 63489 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2220 | |
| 9 | 873 | 21.0% |
| 8 | 661 | 15.9% |
| 1 | 210 | 5.0% |
| 3 | 65 | 1.6% |
| 4 | 56 | 1.3% |
| 2 | 36 | 0.9% |
| 5 | 27 | 0.6% |
| 6 | 9 | 0.2% |
| 0 | 3 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1550 | |
| . | 714 | |
| / | 177 | 6.5% |
| , | 165 | 6.1% |
| & | 73 | 2.7% |
| ? | 25 | 0.9% |
| # | 3 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 58598 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4331 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 385 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 297 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 187 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1412413 | |
| Common | 70665 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 169779 | 12.0% |
| o | 113886 | 8.1% |
| e | 111771 | 7.9% |
| n | 109067 | 7.7% |
| i | 102293 | 7.2% |
| r | 76609 | 5.4% |
| t | 66878 | 4.7% |
| s | 57490 | 4.1% |
| l | 54814 | 3.9% |
| u | 50124 | 3.5% |
| Other values (99) | 499702 |
Common
| Value | Count | Frequency (%) |
| 58598 | ||
| - | 4331 | 6.1% |
| 7 | 2220 | 3.1% |
| ' | 1550 | 2.2% |
| 9 | 873 | 1.2% |
| . | 714 | 1.0% |
| 8 | 661 | 0.9% |
| ( | 385 | 0.5% |
| ) | 297 | 0.4% |
| 1 | 210 | 0.3% |
| Other values (12) | 826 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1469916 | |
| None | 12819 | 0.9% |
| Latin Ext Additional | 343 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 169779 | 11.6% |
| o | 113886 | 7.7% |
| e | 111771 | 7.6% |
| n | 109067 | 7.4% |
| i | 102293 | 7.0% |
| r | 76609 | 5.2% |
| t | 66878 | 4.5% |
| 58598 | 4.0% | |
| s | 57490 | 3.9% |
| l | 54814 | 3.7% |
| Other values (64) | 548731 |
None
| Value | Count | Frequency (%) |
| é | 2769 | |
| í | 2697 | |
| á | 2045 | |
| ó | 1173 | |
| ê | 1152 | |
| ñ | 422 | 3.3% |
| ú | 349 | 2.7% |
| â | 275 | 2.1% |
| ô | 245 | 1.9% |
| ü | 237 | 1.8% |
| Other values (38) | 1455 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ắ | 171 | |
| ồ | 81 | |
| ừ | 31 | 9.0% |
| ạ | 31 | 9.0% |
| ả | 18 | 5.2% |
| ẫ | 6 | 1.7% |
| ờ | 3 | 0.9% |
| ử | 1 | 0.3% |
| ỷ | 1 | 0.3% |
level3Gid
Text
Missing 
| Distinct | 1717 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 300258 |
| Missing (%) | 88.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 11.93075378 |
| Min length | 11 |
Unique
| Unique | 124 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | CHN.29.7.10_1 |
|---|---|
| 2nd row | KEN.20.2.3_1 |
| 3rd row | PAN.12.6.1_1 |
| 4th row | PAN.3.3.1_1 |
| 5th row | CHN.29.5.1_1 |
| Value | Count | Frequency (%) |
| mmr.14.2.1_1 | 3995 | 10.6% |
| mdg.3.5.1_1 | 907 | 2.4% |
| pan.3.3.1_1 | 740 | 2.0% |
| mmr.14.3.3_1 | 720 | 1.9% |
| cri.4.10.3_1 | 708 | 1.9% |
| mmr.14.3.1_1 | 600 | 1.6% |
| chn.29.5.5_1 | 570 | 1.5% |
| ken.20.2.3_1 | 553 | 1.5% |
| mdg.6.2.3_1 | 539 | 1.4% |
| bol.8.14.1_2 | 531 | 1.4% |
| Other values (1707) | 27973 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 113508 | |
| 1 | 78511 | |
| _ | 37836 | 8.4% |
| 2 | 27108 | 6.0% |
| 3 | 18840 | 4.2% |
| M | 18153 | 4.0% |
| 4 | 17781 | 3.9% |
| R | 12466 | 2.8% |
| 5 | 11565 | 2.6% |
| C | 9801 | 2.2% |
| Other values (24) | 105843 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 186566 | |
| Other Punctuation | 113508 | |
| Uppercase Letter | 113502 | |
| Connector Punctuation | 37836 | 8.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 18153 | |
| R | 12466 | |
| C | 9801 | |
| H | 9366 | |
| N | 9192 | |
| P | 8180 | 7.2% |
| A | 7759 | 6.8% |
| L | 7678 | 6.8% |
| E | 4998 | 4.4% |
| D | 3287 | 2.9% |
| Other values (12) | 22622 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 78511 | |
| 2 | 27108 | 14.5% |
| 3 | 18840 | 10.1% |
| 4 | 17781 | 9.5% |
| 5 | 11565 | 6.2% |
| 6 | 7715 | 4.1% |
| 9 | 7619 | 4.1% |
| 8 | 6256 | 3.4% |
| 7 | 5877 | 3.2% |
| 0 | 5294 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 113508 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 37836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 337910 | |
| Latin | 113502 | 25.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 18153 | |
| R | 12466 | |
| C | 9801 | |
| H | 9366 | |
| N | 9192 | |
| P | 8180 | 7.2% |
| A | 7759 | 6.8% |
| L | 7678 | 6.8% |
| E | 4998 | 4.4% |
| D | 3287 | 2.9% |
| Other values (12) | 22622 |
Common
| Value | Count | Frequency (%) |
| . | 113508 | |
| 1 | 78511 | |
| _ | 37836 | 11.2% |
| 2 | 27108 | 8.0% |
| 3 | 18840 | 5.6% |
| 4 | 17781 | 5.3% |
| 5 | 11565 | 3.4% |
| 6 | 7715 | 2.3% |
| 9 | 7619 | 2.3% |
| 8 | 6256 | 1.9% |
| Other values (2) | 11171 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 451412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 113508 | |
| 1 | 78511 | |
| _ | 37836 | 8.4% |
| 2 | 27108 | 6.0% |
| 3 | 18840 | 4.2% |
| M | 18153 | 4.0% |
| 4 | 17781 | 3.9% |
| R | 12466 | 2.8% |
| 5 | 11565 | 2.6% |
| C | 9801 | 2.2% |
| Other values (24) | 105843 |
level3Name
Text
Missing 
| Distinct | 1653 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 300562 |
| Missing (%) | 88.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.898300117 |
| Min length | 3 |
Unique
| Unique | 120 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Nyalam |
|---|---|
| 2nd row | Segera |
| 3rd row | Ancón |
| 4th row | El Harino |
| 5th row | Bomi |
| Value | Count | Frequency (%) |
| bokpyin | 3995 | 8.0% |
| el | 1132 | 2.3% |
| san | 907 | 1.8% |
| ifanadiana | 907 | 1.8% |
| las | 754 | 1.5% |
| harino | 740 | 1.5% |
| tenasserim | 720 | 1.4% |
| horquetas | 708 | 1.4% |
| poblacion | 702 | 1.4% |
| mergui | 600 | 1.2% |
| Other values (1879) | 39014 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 46937 | 14.1% |
| n | 29594 | 8.9% |
| o | 26066 | 7.8% |
| i | 24210 | 7.2% |
| r | 16414 | 4.9% |
| e | 15331 | 4.6% |
| 12647 | 3.8% | |
| u | 10324 | 3.1% |
| g | 9540 | 2.9% |
| s | 9502 | 2.8% |
| Other values (101) | 133406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 266145 | |
| Uppercase Letter | 49075 | 14.7% |
| Space Separator | 12647 | 3.8% |
| Decimal Number | 2088 | 0.6% |
| Other Punctuation | 1832 | 0.5% |
| Open Punctuation | 785 | 0.2% |
| Close Punctuation | 782 | 0.2% |
| Dash Punctuation | 617 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 46937 | |
| n | 29594 | |
| o | 26066 | 9.8% |
| i | 24210 | 9.1% |
| r | 16414 | 6.2% |
| e | 15331 | 5.8% |
| u | 10324 | 3.9% |
| g | 9540 | 3.6% |
| s | 9502 | 3.6% |
| l | 9402 | 3.5% |
| Other values (52) | 68825 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 7021 | |
| M | 5183 | 10.6% |
| S | 4536 | 9.2% |
| T | 2935 | 6.0% |
| P | 2922 | 6.0% |
| C | 2900 | 5.9% |
| A | 2557 | 5.2% |
| H | 2425 | 4.9% |
| L | 2402 | 4.9% |
| I | 2316 | 4.7% |
| Other values (20) | 13878 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 442 | |
| 1 | 425 | |
| 2 | 417 | |
| 9 | 187 | |
| 4 | 157 | 7.5% |
| 7 | 136 | 6.5% |
| 5 | 101 | 4.8% |
| 3 | 92 | 4.4% |
| 8 | 83 | 4.0% |
| 0 | 48 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1193 | |
| , | 312 | 17.0% |
| ' | 231 | 12.6% |
| / | 74 | 4.0% |
| ! | 22 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 12647 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 785 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 782 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 315220 | |
| Common | 18751 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 46937 | 14.9% |
| n | 29594 | 9.4% |
| o | 26066 | 8.3% |
| i | 24210 | 7.7% |
| r | 16414 | 5.2% |
| e | 15331 | 4.9% |
| u | 10324 | 3.3% |
| g | 9540 | 3.0% |
| s | 9502 | 3.0% |
| l | 9402 | 3.0% |
| Other values (82) | 117900 |
Common
| Value | Count | Frequency (%) |
| 12647 | ||
| . | 1193 | 6.4% |
| ( | 785 | 4.2% |
| ) | 782 | 4.2% |
| - | 617 | 3.3% |
| 6 | 442 | 2.4% |
| 1 | 425 | 2.3% |
| 2 | 417 | 2.2% |
| , | 312 | 1.7% |
| ' | 231 | 1.2% |
| Other values (9) | 900 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 331264 | |
| None | 2516 | 0.8% |
| Latin Ext Additional | 191 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 46937 | 14.2% |
| n | 29594 | 8.9% |
| o | 26066 | 7.9% |
| i | 24210 | 7.3% |
| r | 16414 | 5.0% |
| e | 15331 | 4.6% |
| 12647 | 3.8% | |
| u | 10324 | 3.1% |
| g | 9540 | 2.9% |
| s | 9502 | 2.9% |
| Other values (61) | 130699 |
None
| Value | Count | Frequency (%) |
| é | 537 | |
| ê | 514 | |
| ñ | 263 | |
| ó | 250 | |
| ơ | 194 | 7.7% |
| í | 128 | 5.1% |
| á | 111 | 4.4% |
| â | 87 | 3.5% |
| ü | 86 | 3.4% |
| à | 61 | 2.4% |
| Other values (22) | 285 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 83 | |
| ạ | 30 | 15.7% |
| ồ | 27 | 14.1% |
| ọ | 26 | 13.6% |
| ờ | 10 | 5.2% |
| ắ | 9 | 4.7% |
| ậ | 3 | 1.6% |
| ộ | 3 | 1.6% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 63456 |
| Missing (%) | 18.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | NE |
| 3rd row | NE |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 174813 | |
| lc | 87132 | |
| vu | 3277 | 1.2% |
| nt | 3158 | 1.1% |
| dd | 2671 | 1.0% |
| en | 2568 | 0.9% |
| cr | 968 | 0.4% |
| ex | 34 | < 0.1% |
| ew | 17 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 180539 | |
| E | 177432 | |
| C | 88100 | |
| L | 87132 | |
| D | 5342 | 1.0% |
| V | 3277 | 0.6% |
| U | 3277 | 0.6% |
| T | 3158 | 0.6% |
| R | 968 | 0.2% |
| X | 34 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 549276 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 180539 | |
| E | 177432 | |
| C | 88100 | |
| L | 87132 | |
| D | 5342 | 1.0% |
| V | 3277 | 0.6% |
| U | 3277 | 0.6% |
| T | 3158 | 0.6% |
| R | 968 | 0.2% |
| X | 34 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 549276 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 180539 | |
| E | 177432 | |
| C | 88100 | |
| L | 87132 | |
| D | 5342 | 1.0% |
| V | 3277 | 0.6% |
| U | 3277 | 0.6% |
| T | 3158 | 0.6% |
| R | 968 | 0.2% |
| X | 34 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 549276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 180539 | |
| E | 177432 | |
| C | 88100 | |
| L | 87132 | |
| D | 5342 | 1.0% |
| V | 3277 | 0.6% |
| U | 3277 | 0.6% |
| T | 3158 | 0.6% |
| R | 968 | 0.2% |
| X | 34 | < 0.1% |